Defensive Programming: How to (Help) Shield Your Code From Error

Michael Bertolacci
michael_bertolacci@uow.edu.au
Centre for Environmental Informatics, UOW, Australia

2021-02-17

Who I am

A postdoc at the University of Wollongong, working on applied spatio-temporal statistical problems with Noel Cressie and Andrew Zammit-Mangion.

Before that, I was a PhD student at the University of Western Australia.

Before that, I was a software engineer for 10 years.

So just some quick background on me, I am a postdoc at the University of Wollongong, I work with Noel Cressie and Andrew Zammit-Mangion, who you will hear from at various times during this symposium.

Before that I was a PhD student at UWA.

But before that, I was a software engineer for 10 years. And really this talk is about software engineering. These days as students and practitioners in spatio-temporal statistics we spend a lot of time programming. There are a lot of lessons to be learned from the field of software engineering, and I want to share some of those.

In a way this is a bit tangential to the topic of the symposium, but in another way it is not. That’s because the computational burden is relatively high in spatio-temporal statistics compared to other sub-disciplines, so the code we write is pretty complicated sometimes.

Assumed programming knowledge

You have programmed before.
You can read basic R or Python.

Motivation

A meme

There are several things that can (should?) keep a statistician from falling asleep at night.

One of those is Simpson’s paradox:

Another is having bugs in your code.

Story time

Reinhart, C, M.; Rogoff, K. S. (2010). Growth in a Time of Debt. American Economic Review. 100 (2): 573–78. doi:10.1257/aer.100.2.573.

Their conclusions led several governments to implement financial austerity programs.

A student reproducing the work found that they’d made an error in a formula in their Excel spreadsheet, invalidating some of their empirical claims.

Story time

My first research project…

The first time I ever did research was in computer science, back in my undergrad.
I was trying to reproduce the results of a published paper. This was in computer science, but the specific topic doesn’t matter. I had implemented their methods, and some of the numbers I was getting in a simulation study were different to the authors.
So I asked the authors for their own code so I could compare. They kindly obliged, and it turned out they had a bug in one of their methods. In fact I had made the same error in the same method earlier but, luckily, I’d noticed it. They weren’t so lucky.
Perhaps it doesn’t matter in the grand scheme of things, but one part of their results were wrong, a little blip in science. I did tell them but nothing really came of it—I’m not blaming them—and I left it.
But it’s just sad, really, to have a little bit of science wrong. I have seen this again since.
In all cases the work was peer-reviewed.
How does this happen?

Peer review - the crown jewel of science

We peer review each other’s work prior to publication.

Keeps us honest
Encourages rigour
Ensures the work is useful
Catches errors

But it generally only applies to the written part of the work. If the numbers reported look plausible, there’s (usually) an assumption that they are computed correctly.

Code is a blind spot for peer review.

Bugs aren’t just a problem in academic work. Professional software developers invest huge effort into avoiding and fixing bugs.

Some gold standard strategies the best teams use:

Peer review: software is not released until the code has been read by multiple people
Dedicated testing teams: you can get paid to break code, all day! https://www.seek.com.au/software-tester-jobs

Most academics don’t get to use these strategies…

So bugs should be a huge concern in academic work.

But they are also a concern in professional software work. Software engineers invest huge effort into avoiding and fixing bugs.

There are lots and lots of strategies. But two gold-standard strategies, used by the best teams are peer review and dedicated testing teams.

In peer review, the software is not released until the code has been read by multiple people. Every time a change is made to the code, that is reviewed too. It’s similar to academic peer review.

Lots of companies have dedicated testing teams. People actually get paid to break code, all day, every day! They don’t even have to write the code.

Unfortunately, these are not an option for most academics. Most of the code is worked on by one person and no-one else looks at it.

There are some strategies that an individual programmer can use.

We’ll cover two:

Strategy 1: Make your code easier to understand
Strategy 2: Defensive programming

The main one we won’t cover is automated testing.

Every bug can be traced back to a wrong assumption.

The person who wrote the code made the assumption.
⇒ You wrote the code.
⇒ You caused the bug :(

What you need are ways to:

Make your assumptions more obvious so you avoid making wrong ones (Strategy 1).
Make it so you know when an assumption is violated so you can find and fix the error (Strategy 2).

Strategy 1: Make your code easier to understand

Code that is hard to read is hard to understand.

You should strive to make your code clean and readable.

This helps:

You now, when you debug the code
You in the future when you read the code
Everyone who reads the code

The DRY principle

Don’t Repeat Yourself.

\[ \textrm{RMS}(x_1, \ldots, x_n) = \sqrt{\frac{1}{n} \sum_{i = 1}^n x_i^2} \]

rms1 = np.sqrt(np.mean(
  np.square(error1)
))
rms2 = np.sqrt(np.mean(
  np.square(error2)
))

def rms(x):
    '''Return the root mean square of
    the vector x'''
    return np.sqrt(np.mean(
      np.square(x)
    ))


rms1 = rms(error1)
rms2 = rms(error2)

Follow a style guide

Style guides list rules to follow when laying out your code.

You should start from a publicly available guide.

For R, you can follow the Tidyverse style guide [1].

For Python, the standard is called PEP 8 [2].

You can tweak the guidelines to suit you, but not following some style guide is like writing a paper with inconsistent grammar, or using two different citation styles in the same document (Bertolacci, 2020).

R style example

Non-compliant:

y <- x*2
if(y > 2) {
  z <- y + 2
}
sqrt_z <- sqrt (z)

Compliant:

y <- x * 2
if (y > 2) {
  z <- y + 2
}
sqrt_z <- sqrt(z)

The R package lintr [3] can check your code automatically:

example.R:1:7: style: Put spaces around all infix operators.
y <- x*2
     ~^~
example.R:2:3: style: Place a space before left parenthesis, except in a function call.
if(y > 2) {
  ^
example.R:5:11: style: Remove spaces before the left parenthesis in a function call.
sqrt_z <- sqrt (z)

Python style example

Non-compliant:

y = x*2
if y> 2:
   z = y + 2
sqrt_z = sqrt (z)

Compliant:

y = x * 2
if y > 2:
    z = y + 2
sqrt_z = sqrt(z)

The program pycodestyle [4] checks if you are following the rules:

example.py:2:5: E225 missing whitespace around operator
example.py:3:4: E111 indentation is not a multiple of four
example.py:4:14: E211 whitespace before '('

Sometimes you knew what the code does when you wrote it, but a few months later…

loc<-c(137.5, -4.7)
loc2<-c(77.5, 18.1)
dat<-read.csv('locations.csv')
d=2*3389.5*asin(sqrt(
sin((dat$lat*pi/180-loc[2]*pi/180)/2)^2
+cos(dat$lat*pi/180)*cos(loc[2]*pi/180)
*sin((dat$lon*pi/180-loc[1]*pi/180)/2)^2))
d2=2*3389.5*asin(sqrt(
sin((dat$lat*pi/180-loc2[2]*pi/180)/2)^2
+cos(dat$lat*pi/180)*cos(loc2[2]*pi/180)
*sin((dat$lon*pi/180-loc2[1]*pi/180)/2)^2))
#dat$nrst=-1
dat$nrst=0
for(i in 1:nrow(dat)){
if(d[i]<d2[i])
dat$nrst[i]<-1
else
dat$nrst[i]<-2
}

MARS_RADIUS <- 3389.5  # in kilometres

hav_deg <- function(x) sin(x * pi / 360) ^ 2
cos_deg <- function(x) cos(x * pi / 180)
mars_haversine_dist <- function(locations, origin) {
  with(locations, {
    2 * MARS_RADIUS * asin(sqrt(
      hav_deg(latitude - origin['latitude'])
      + cos_deg(latitude) * cos_deg(origin['latitude'])
      * hav_deg(longitude - origin['longitude'])
    ))
  })
}

curiosity_location <- c(longitude = 137.5, latitude = -4.7)
perseverance_location <- c(longitude = 77.5, latitude = 18.1)

locations <- read.csv('locations.csv')
curiousity_dist <- mars_haversine_dist(locations, curiosity_location)
perseverance_dist <- mars_haversine_dist(locations, perseverance_location)

locations$nearest_rover <- 'none'
for (i in 1 : nrow(locations)) {
  if (curiousity_dist[i] < perseverance_dist[i]) {
    locations$nearest_rover[i] <- 'curiousity'
  } else {
    locations$nearest_rover[i] <- 'perseverance'
  }
}

Now here’s a cleaned-up version, luckily I wrote this program a few days ago, not months ago, so I know what it does.

Well the weird number now has a name, it’s called MARS_RADIUS. That’s a good hint.

Now there’s a function called mars_haversine_dist. Ahhh, so that computes a distance on Mars using the haversine formula. The repetition is gone for the conversions to radians too.

Now the location variables have better names, curiousity location and perseverance location. These are the locations of two Mars rovers. Actually Perseverance isn’t on Mars yet but it’s going to land here.

And now is we scroll down and look at the end, it might be a bit clearer that we have a table of locations ON MARS, and we are finding the distance between these locations and the rovers, then finding the closest rover.

All this is relatively easy to figure out because the code uses nice names for everything, doesn’t repeat itself, and has a logical layout. If you read this code years from now, there’s a good chance you will understand its purpose, at least partly.

More clean code tips

Code for correctness and readability first.
Comments are great, but it’s better if the code is obvious enough not to need comments.
Shorter isn’t necessarily better—short programs don’t run any faster!
Tab completion in your editor means you don’t have to type longer function and variable names every time.
Picking good names for things is hard, but it’s worth the effort.

Strategy 2: Defensive programming…

…or, dead programs tell no lies [5].

A tale of two errors…

n <- length(y)
sigma <- 1
log_likelihood <- (
  - n * log(2 * pi) / 2
  - n * log(sigma)
  - sum(y ^ 2 / sigma ^ 2) / 2
)
bic <- log(n) - 2 * log_likelihood
print(bic)

## [1] NA

No explicit error, but you probably don’t want that value to be NA. What’s the root cause?

dat <- data.frame(x = 1 : 10)
dat$y <- sin(df$x)
fit <- lm(y ~ x, data = dat)
print(coef(fit))
plot(dat$x, fitted(dat$y))

## Error in df$x: object of type
##   'closure' is not subsettable

Explicit error; annoying, but offending line is obvious.

So I’ve put two R programs here, one on the left, one on the right. On the left it’s calculating a Gaussian log likelihood of some vector y with standard deviation sigma, then a BIC value. And look, we got the dreaded NA answer, which is typically a missing value.

There’s no explicit error here, no line failed. So you have to figure out the root cause. Luckily at least we noticed the NA.

Now on the right, there’s code with another problem. And instead of a bad answer, the code threw an error. In fact a classic R error. Now in a lot of ways this is a much nicer error than on the left because you know the offending line immediately. It threw an error.

This is what is meant by dead programs tell no lies. Code that actually stops because of an error is better than code that continues to run and gives the wrong answer.

Dead programs tell no lies

Defensive programming can be used when you’re aware you’re making an assumption. The idea is to intentionally throw an error when an assumption found to be false.

Most languages have functions to help you do this.

This way, you don’t have to search for the offending line.

Example: the exponential covariance function,

\[ C(\mathbf{x}, \mathbf{x}') \equiv \exp\left( -\frac{||\mathbf{x} - \mathbf{x}'||}{l} \right) \]

where \(l > 0\).

In R, you can use the stopifnot function:

# Given a vector or matrix x, return the covariance matrix based on the
# exponential covariance function
cov_exponential <- function(x, length_scale) {
  stopifnot(length_scale > 0)
  exp(-fields::rdist(x) / length_scale)
}

cov_exponential(y, -1)

## Error in cov_exponential(y, -1): length_scale > 0 is not TRUE

Here’s a statistical example in R. At the top I’ve given the definition of the exponential covariance function, between two points x and x prime. One of the things you will notice is that it has one parameter, l, called the length scale, and it must be greater than zero.

And below is some R code that implements it, as the comment says. Now the first line of the function cov_exponential has a line using an R function called stopifnot. This function will throw an error if the expression inside it is false.

The error is very informative, as you can see. Now if you notice, without this stopifnot expression, the function would blindly accept a negative length and return a number. But you know up front that the length scale should be positive. So if you ever accidently pass a number in that’s negative, you’re find out straight away.

In Python, you can use the assert keyword:

import numpy as np

def cov_exponential(x, length_scale):
    '''Given a vector or matrix x, return the covariance matrix based on the
    exponential covariance function'''
    assert length_scale > 0, 'length_scale must be positive'
    return np.exp(-np.linalg.norm(x) / length_scale)

cov_exponential(y, -1)

Traceback (most recent call last):
  File "example.py", line 7, in <module>
    cov_exponential(y, -1)
  File "example.py", line 4, in cov_exponential
    assert length_scale > 0, 'length_scale must be positive'
AssertionError: length_scale must be positive

Back to the tale of two errors

stopifnot(all(!is.na(y)))
n <- length(y)
sigma <- 1
log_likelihood <- (
  - n * log(2 * pi) / 2
  - n * log(sigma)
  - sum(y ^ 2 / sigma ^ 2) / 2
)
bic <- log(n) - 2 * log_likelihood
print(bic)

## Error: all(!is.na(y)) is not TRUE

More defensive programming tips

You won’t always know what assumptions you’re making implicitly immediately.
But! when your code breaks, add a stopifnot or assert to catch it next time.
Don’t feel the need to check everything, just the most important parts.
In R, the assertthat package [6] can help.

Other strategies

Automated testing. See:
- Wikipedia on unit testing [7]
- The testthat package in R [8]
- The pytest framework for Python [9]
Not reinventing the wheel. If a mature package has a function you need, use that rather than write your own.
Internal peer review:
- Can a coauthor or a friend read your code for you?
Automated checking for common bugs:
- The pylint program can do this for Python [10].
- The lintr package does a little bit of this [3].

Conclusion

Bugs can taint otherwise great research, and peer review (probably) won’t catch them.

Some easy-to-follow strategies can help you avoid some bugs. Practice makes perfect!

Sadly, though, there will always be more bugs. If you find a way to avoid them all, please tell me!

Thanks everyone!

Resources

[1] Wickham, H., (2021). The tidyverse Style Guide. Published online at https://style.tidyverse.org/.
[2] van Rossum, G., Warsaw, B., and Coghlan, N. PEP 8—Style Guide for Python Code. https://www.python.org/dev/peps/pep-0008/, last accessed 2021-02-16.
[3] Hester, J., Angly, F., and Hyde, R. (2020). lintr: A ‘Linter’ for R Code. R package version 2.0.1. https://CRAN.R-project.org/package=lintr.
[4] Rocholl, J. and Lee, I., and other contributors. pycodestyle—Python Style Guide Checker. https://pypi.org/project/pycodestyle/.
[5] Thomas, D., Hunt, A. (2019). The Pragmatic Programmer: Your Journey to Mastery (20th ed.). Addison-Wesley Professional, Boston, MA.
[6] Wickham, H. (2019). assertthat: Easy Pre and Post Assertions. R package version 0.2.1. https://CRAN.R-project.org/package=assertthat.
[7] Wikipedia contributors. (2021, January 23). Unit Testing. In Wikipedia, The Free Encyclopedia. Retrieved 00:49, February 17, 2021, from https://en.wikipedia.org/w/index.php?title=Unit_testing&oldid=1002146431
[8] Wickham, H., Bryan, J. (2021). R Packages (2nd ed.). Published online at https://r-pkgs.org/index.html.
[9] Krekel, H., and other contributors, (2021). pytest. https://docs.pytest.org/en/stable/.
[10] Logilab, (2021). pylint. https://www.pylint.org/.

Defensive Programming: How to (Help) Shield Your Code From Error

Michael Bertolaccimichael_bertolacci@uow.edu.auCentre for Environmental Informatics, UOW, Australia

2021-02-17

Who I am

Assumed programming knowledge

Motivation

Story time

Story time

Peer review - the crown jewel of science

How big is the blind spot?

Every bug can be traced back to a wrong assumption.

Strategy 1: Make your code easier to understand

The DRY principle

Follow a style guide

R style example

Python style example

More clean code tips

Strategy 2: Defensive programming…

A tale of two errors…

Dead programs tell no lies

Back to the tale of two errors

More defensive programming tips

Other strategies

Conclusion

Resources

Michael Bertolacci
michael_bertolacci@uow.edu.au
Centre for Environmental Informatics, UOW, Australia