Data driven tests

I’m not sure if anybody uses the terminology “data driven test” but if you explain what it is, experienced people will tel you that they are bad. Data driven tests are tests with the same code repeating over many different pieces of data.

Let’s show an example. For my startup project Keep on Posting, I have a method that turns a blog url into a feed url. That method is critical for my application and there are many things that can go wrong, so I test it by querying a sample of real blogs. The test would be something like this (this is in Ruby):

[sourcecode lang=”ruby”]
class BlogToolsTest
BLOGS_AND_FEES =>
"http://blog.sandrafernandez.eu" => "http://blog.sandrafernandez.eu/feed/",
"http://www.lejanooriente.com" => "http://www.lejanooriente.com/feed/",
"http://pupeno.com" => "http://pupeno.com/feed/",
"http://www.avc.com/a_vc" => "http://feeds.feedburner.com/avc",
}

def test_blog_to_feed_url
BLOGS_AND_FEEDS.each_pair do |blog_url, feed_url|
assert_true feed_url == BlogTools.blog_to_feed(blog_url)
end
end
end
[/sourcecode]

Note: I’m using assert_true instead of assert_equal to make a point; these kind of tests tend to user assert_true.

The problem with that is that eventually it’ll fail and it’ll say something like:

[sourcecode]
false is not true
[/sourcecode]

Oh! so useful. Let’s see at least where the error is happening… and obviously it’ll point to this line:

[sourcecode lang=”ruby”]
assert_true feed_url == BlogTools.blog_to_feed(blog_url)
[/sourcecode]

which is almost as useless as the failure message. That’s the problem with data drive tests. You might be tempted to do this an re-run the tests:

[sourcecode lang=”ruby”]
def test_blog_to_feed_url
BLOGS_AND_FEEDS.each_pair do |blog_url, feed_url|
puts blog_url
puts feed_url
assert_true feed_url == BlogTools.blog_to_feed(blog_url)
end
end
[/sourcecode]

but if your tests take hours to run, like the ones I often found while working at Google, then you are wasting time. Writing good error messages ahead of time help:

[sourcecode lang=”ruby”]
def test_blog_to_feed_url
BLOGS_AND_FEEDS.each_pair do |blog_url, feed_url|
assert_true feed_url == BlogTools.blog_to_feed(blog_url), "#{blog_url} should have returned the feed #{feed_url}"
end
end
[/sourcecode]

and if half your cases fail and the whole suit takes an hour to run and you have 1000 data sets you’ll spend hours staring at your monitor fixing one test every now and then, because as soon as one case fails, the execution of the tests is halted. If you are coding in a language like Java, that’s as far as you can take it.

With Ruby you can push the boundaries and write it this way (thanks to executable class bodies):

BLOGS_AND_FEEDS.each_pair do |blog_url, feed_url|
define_method "test_#{blog_url}_#{feed_url}" do
assert_true feed_url == BlogTools.blog_to_feed(blog_url), "#{blog_url} should have returned the feed #{feed_url}"
end
end
end
[/sourcecode]

That will generate one method per item of data, even if one fails, the rest will be executed as they are separate isolated tests. They will also be executed in a potential random order so you don’t have tests depending on tests and even if you don’t get a nice error message, you’ll know which piece of data is the problematic through the method name.

Note: that actually doesn’t work because blog_url and feed_url have characters that are not valid method names, they should be replaced, but I wanted to keep the example concise.

Since I’m using shoulda, my resulting code looks like this:

BLOGS_AND_FEEDS.each_pair do |blog_url, feed_url|
should "turn blog #{blog_url} into feed #{feed_url}" do
assert_equal feed_url, BlogTools.blog_to_feed(blog_url), "#{blog_url} did not resolve to the feed #{feed_url}"
end
end
end
[/sourcecode]

and running them in RubyMine looks like this:

3 responses to “Data driven tests”

2011-03-22

Rory

“If you are coding in a language like Java, that’s as far as you can take it.”

That’s not true. You can programmatically generate test cases in JUnit3, and I understand it’s made even easier in JUnit4. It’s similarly easy in TestNG.

Loading…

Reply
1. 2011-03-22
  
  J. Pablo Fernández
  
  Oh, I’d like to see some sample code for that…. or is it that thing where you have an XML file associated with a test as test source? That would work although I find it a little bit too contrived.
  
  Loading…
  
  Reply
  1. 2011-03-22
    
    Rory
    
    In JUnit 3 as test case is just an instance of a subclass of TestCase. There are two ways to create them.
    
    You can define methods named like “testSomething” and have the JUnit infrastructure generate objects automatically, one for each test method. That’s the usual way to create tests that you’re familiar with, but there’s no easy way to loop through some data set and run the same test for each test datum.
    
    The other way is to define a TestSuite that you populate with TestCase instances yourself. You override the ‘void runTest(TestResult)’ method with your own test logic. So you can define a subclass of TestCase with constructor parameters for, e.g., blog URL and feed URL, and have runTest contain the test logic. Then you construct a suite containing one instance of that test class for each set of test data.
    
    I hope that makes sense. I didn’t find an example of that kind of code online, and I’m wary of trying to type it from memory into a comment text box. ;)
    
    Loading…

Comments

3 responses to “Data driven tests”

Leave a Reply Cancel reply

More posts

I got my hand in Apple’s cookie jar

When failure was an option

Your competitive advantage is your agency

The SaaSpocalypse is real, but not for the reasons you think