Testing

Testing

Over decades of software development,people have discovered one truth:Untested software rarely works.(Many people would go as far as saying: “Most tested software doesn’t work either.”But we are all optimists here, right?)So, to ensure that your program does what you expect it to do,it is wise to test it.

One easy way to do that isto write a README filethat describes what your program should do.And when you feel ready to make a new release,go through the README and ensure thatthe behavior is still as expected.You can make this a more rigorous exerciseby also writing down how your program should react to erroneous inputs.

Here’s another fancy idea:Write that README before you write the code.

Aside:Have a look attest-driven development (TDD)if you haven’t heard of it.

Automated testing

Now, this is all fine and dandy,but doing all of this manually?That can take a lot of time.At the same time,many people have come to enjoy telling computers to do things for them.Let’s talk about how to automate these tests.

Rust has a built-in test framework,so let’s start by writing a first test:

#[test]
fn check_answer_validity() {
    assert_eq!(answer(), 42);
}

You can put this snippet of code in pretty much any fileand cargo test will findand run it.The key here is the #[test] attribute.It allows the build system to discover such functionsand run them as tests,verifying that they don’t panic.

Exercise for the reader:Make this test work.

You should end up with output like the following:

running 1 test
test check_answer_validity ... ok
test result: ok. 1 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out

Now that we’ve seen how we can write tests,we still need to figure out what to test.As you’ve seen it’s fairly easy to write assertionsfor functions.But a CLI application is often more than one function!Worse, it often deals with user input,reads files,and writes output.

Making your code testable

There are two complementary approaches to testing functionality:Testing the small units that you build your complete application from,these are called “unit tests”.There is also testing the final application “from the outside”called “black box tests” or “integration tests”.Let’s begin with the first one.

To figure out what we should test,let’s see what our program features are.Mainly, grrs is supposed to print out the lines that match a given pattern.So, let’s write unit tests for exactly this:We want to ensure that our most important piece of logic works,and we want to do it in a way that is not dependenton any of the setup code we have around it(that deals with CLI arguments, for example).

Going back to our first implementation of grrs,we added this block of code to the main function:

// ...
for line in content.lines() {
    if line.contains(&args.pattern) {
        println!("{}", line);
    }
}

Sadly, this is not very easy to test.First of all, it’s in the main function, so we can’t easily call it.This is easily fixed by moving this piece of code into a function:


#![allow(unused_variables)]
fn main() {
fn find_matches(content: &str, pattern: &str) {
    for line in content.lines() {
        if line.contains(pattern) {
            println!("{}", line);
        }
    }
}
}

Now we can call this function in our test,and see what its output is:

#[test]
fn find_a_match() {
    find_matches("lorem ipsum\ndolor sit amet", "lorem");
    assert_eq!( // uhhhh

Or… can we?Right now, find_matches prints directly to stdout, i.e., the terminal.We can’t easily capture this in a test!This is a problem that often comes upwhen writing tests after the implementation:We have written a function that is firmly integratedin the context it is used in.

Note:This is totally fine when writing small CLI applications.There’s no need to make everything testable!It is important to think aboutwhich parts of your code you might want to write unit tests for, however.While we’ll see that it’s easy to change this function to be testable,this is not always the case.

Alright, how can we make this testable?We’ll need to capture the output somehow.Rust’s standard library has some neat abstractionsfor dealing with I/O (input/output)and we’ll make use of one called std::io::Write.This is a trait that abstracts over things we can write to,which includes strings but also stdout.

If this is the first time you’ve heard “trait”in the context of Rust,you are in for a treat.Traits are one of the most powerful features of Rust.You can think of them like interfaces in Java,or type classes in Haskell(whatever you are more familiar with).They allow you to abstract over behaviorthat can be shared by different types.Code that uses traits canexpress ideas in very generic and flexible ways.This means it can also get difficult to read, though.Don’t let that intimidate you:Even people who have used Rust for yearsdon’t always get what generic code does immediately.In that case,it helps to think of concrete uses.For example,in our case,the behavior that we abstract over is “write to it”.Examples for the types that implement (”impl”) itinclude:The terminal’s standard output,files,a buffer in memory,or TCP network connections.(Scroll down in the documentation for std::io::Writeto see a list of “Implementors”.)

With that knowledge,let’s change our function to accept a third parameter.It should be of any type that implements Write.This way,we can then supply a simple stringin our testsand make assertions on it.Here is how we can write this version of find_matches:

fn find_matches(content: &str, pattern: &str, mut writer: impl std::io::Write) {
    for line in content.lines() {
        if line.contains(pattern) {
            writeln!(writer, "{}", line);
        }
    }
}

The new parameter is mut writer,i.e., a mutable thing we call “writer”.Its type is impl std::io::Write,which you can read as“a placeholder for any type that implements the Write trait”.Also note how wereplaced the println!(…)we used earlierwith writeln!(writer, …).println! works the same as writeln!but always uses standard output.

Now we can test for the output:

#[test]
fn find_a_match() {
    let mut result = Vec::new();
    find_matches("lorem ipsum\ndolor sit amet", "lorem", &mut result);
    assert_eq!(result, b"lorem ipsum\n");
}

To now use this in our application code,we have to change the call to find_matches in mainby adding &mut std::io::stdout() as the third parameter.Here’s an example of a main functionthat builds on what we’ve seen in the previous chaptersand uses our extracted find_matches function:

fn main() -> Result<(), ExitFailure> {
    let args = Cli::from_args();
    let content = std::fs::read_to_string(&args.path)
        .with_context(|_| format!("could not read file `{}`", args.path.display()))?;
    find_matches(&content, &args.pattern, &mut std::io::stdout());
    Ok(())
}

Note:Since stdout expects bytes (not strings),we use std::io::Write instead of std::fmt::Write.As a result,we give an empty vector as “writer” in our tests(its type will be inferred to Vec<u8>),in the assert_eq! we use a b"foo".(The b prefix makes this a _byte string literal_so its type is going to be &[u8] instead of &str).

Note:We could also make this function return a String,but that would change its behavior.Instead of writing to the terminal directly,it would then collect everything into a string,and dump all the results in one go at the end.

Exercise for the reader:writeln! returns an io::Resultbecause writing can fail,for example when the buffer is full and cannot be expanded.Add error handling to find_matches.

We’ve just seen how to make this piece of code easily testable.We have

identified one of the core pieces of our application,
put it into its own function,
and made it more flexible.Even though the goal was to make it testable,the result we ended up withis actually a very idiomatic and reusable piece of Rust code.That’s awesome!

Splitting your code into library and binary targets

We can do one more thing here.So far we’ve put everything we wrote into the src/main.rs file.This means our current project produces a single binary.But we can also make our code available as a library, like this:

Put the find_matches function into a new src/lib.rs.
Add a pub in front of the fn (so it’s pub fn find_matches)to make it something that users of our library can access.
Remove find_matches from src/main.rs.
In the fn main, prepend the call to find_matches with grrs::,so it’s now grrs::find_matches(…).This means it uses the function from the library we just wrote!The way Rust deals with projects is quite flexibleand it’s a good idea to think aboutwhat to put into the library part of your crate early on.You can for example think about writing a libraryfor your application-specific logic firstand then use it in your CLI just like any other library.Or, if your project has multiple binaries,you can put the common functionality into the library part of that crate.

Note:Speaking of putting everything into a src/main.rs:If we continue to do that,it’ll become difficult to read.The module system can help you structure and organize your code.

Testing CLI applications by running them

Thus far, we’ve gone out of our wayto test the business logic of our application,which turned out to be the find_matches function.This is very valuableand is a great first steptowards a well-tested code base.(Usually, these kinds of tests are called “unit tests”.)

There is a lot of code we aren’t testing, though:Everything that we wrote to deal with the outside world!Imagine you wrote the main function,but accidentally left in a hard-coded stringinstead of using the argument of the user-supplied path.We should write tests for that, too!(This level of testing is often called“integration testing”, or “system testing”.)

At its core,we are still writing functionsand annotating them with #[test].It’s just a matter of what we do inside these functions.For example, we’ll want to use the main binary of our project,and run it like a regular program.We will also put these tests into a new file in a new directory:tests/cli.rs.

Aside:By convention,cargo will look for integration tests in the tests/ directory.Similarly,it will look for benchmarks in benches/,and examples in examples/.These conventions also extend to your main source code:libraries have a src/lib.rs file,the main binary is src/main.rs,or, if there are multiple binaries,cargo expects them to be in src/bin/<name>.rs.Following these conventions will make your code base more discoverableby people used to reading Rust code.

To recall,grrs is a small tool that searches for a string in a file.We have previously tested that we can find a match.Let’s think about what other functionality we can test.

Here is what I came up with.

What happens when the file doesn’t exist?
What is the output when there is no match?
Does our program exit with an error when we forget one (or both) arguments?

These are all valid test cases.Additionally,we should also include one test casefor the “happy path”,i.e., we found at least one matchand we print it.

To make these kinds of tests easier,we’re going to use the assert_cmd crate.It has a bunch of neat helpersthat allow us to run our main binaryand see how it behaves.Further,we’ll also add the predicates cratewhich helps us write assertionsthat assert_cmd can test against(and that have great error messages).We’ll add those dependencies not to the main list,but to a “dev dependencies” section in our Cargo.toml.They are only required when developing the crate,not when using it.

[dev-dependencies]
assert_cmd = "0.10"
predicates = "1"

This sounds like a lot of setup.Nevertheless –let’s dive right inand create our tests/cli.rs file:

use std::process::Command;  // Run programs
use assert_cmd::prelude::*; // Add methods on commands
use predicates::prelude::*; // Used for writing assertions
#[test]
fn file_doesnt_exist() -> Result<(), Box<std::error::Error>> {
    let mut cmd = Command::cargo_bin("grrs")?;
    cmd.arg("foobar")
        .arg("test/file/doesnt/exist");
    cmd.assert()
        .failure()
        .stderr(predicate::str::contains("No such file or directory"));
    Ok(())
}

You can run this test withcargo test,just the tests we wrote above.It might take a little longer the first time,as Command::cargo_bin("grrs") needs to compile your main binary.

Generating test files

The test we’ve just seen only checks that our program writes an error messagewhen the input file doesn’t exist.That’s an important test to have,but maybe not the most important one:Let’s now test that we will actually print the matches we found in a file!

We’ll need to have a file whose content we know,so that we can know what our program should returnand check this expectation in our code.One idea might be to add a file to the project with custom contentand use that in our tests.Another would be to create temporary files in our tests.For this tutorial,we’ll have a look at the latter approach.Mainly, because it is more flexible and will also work in other cases;for example, when you are testing programs that change the files.

To create these temporary files,we’ll be using the tempfile crate.Let’s add it to the dev-dependencies in our Cargo.toml:

tempfile = "3"

Here is a new test case(that you can write below the other one)that first creates a temp file(a “named” one so we can get its path),fills it with some text,and then runs our programto see if we get the correct output.When the file goes out of scope(at the end of the function),the actual temporary file will automatically get deleted.

use tempfile::NamedTempFile;
use std::io::{self, Write};
#[test]
fn find_content_in_file() -> Result<(), Box<std::error::Error>> {
    let mut file = NamedTempFile::new()?;
    writeln!(file, "A test\nActual content\nMore content\nAnother test")?;
    let mut cmd = Command::cargo_bin("grrs")?;
    cmd.arg("test")
        .arg(file.path());
    cmd.assert()
        .success()
        .stdout(predicate::str::contains("test\nAnother test"));
    Ok(())
}

Exercise for the reader:Add integration tests for passing an empty string as pattern.Adjust the program as needed.

What to test?

While it can certainly be fun to write integration tests,it will also take some time to write them,as well as to update them when your application’s behavior changes.To make sure you use your time wisely,you should ask yourself what you should test.

In general it’s a good idea to write integration testsfor all types of behavior that a user can observe.That means that you don’t need to cover all edge cases:It usually suffices to have examples for the different typesand rely on unit tests to cover the edge cases.

It is also a good idea not to focus your tests on things you can’t actively control.It would be a bad idea to test the exact layout of —helpas it is generated for you.Instead, you might just want to check that certain elements are present.

Depending on the nature of your program,you can also try to add more testing techniques.For example,if you have extracted parts of your programand find yourself writing a lot of example cases as unit testswhile trying to come up with all the edge cases,your should look into proptest.If you have a program which consumes arbitrary files and parses them,try to write a fuzzer to find bugs in edge cases.

Aside:You can find the full, runnable source code used in this chapterin this book’s repository.