Monadic Test Data Generator

Quicksearch

Login

Delimited Continuations vs. For-Comprehensions >

Monadic Test Data Generator

Saturday, April 30. 2011

Recently, I came across the need to generate test data for a protocol converter: a pair of functions converting one set of classes into another and back. To give you a bit more background, each set of classes represents the set of messages which can be exchanged between a service and a network client: one set which is used by the service internally, and the other set which is tied to the specific serialization protocol implementation (in this case Google protocol buffers). The protocol converter then boils down to some large pattern matches with quite boring code, hence the need for good and thorough verification.

In former times, I would have written down a smallish set of test data by hand, hoping to cover most cases. From that developed a technique where I explicitly convert some of these test instances into functions, leaving out one or two of the arguments and applying some randomly generated data to these. This approach does not scale, so inspired by the new Future.flow mechanism in the upcoming Akka 1.1 release (based on delimited continuations and monadic composition) I decided to try out something new: monadic test data generators.

The principle is quite simple: provide generators for simple types and support composition for building up complex generators. The latter is naturally supported by Scala's for-comprehensions and based upon implementing map and flatMap. The generator skeleton looks surprisingly simple:

class Generator[+T](f : () => T) extends Function0[T] {

```
  override def apply() = f()
```

  def apply(n : Int) = 0 to n map ( x => f() )

```
 
```

  def map[TT](ff : T => TT) = Generator( () => ff(f.apply) )

  def flatMap[TT](ff : T => Generator[TT]) = Generator( () => ff(f.apply).apply )

```
}
```

Luckily—though I don't really believe in coincidence—the generator can be covariant in its generated type. The first apply is the single-element generator, while the second apply(Int) generates a sequence of elements. A generator for one type can be used to construct a generator for another type by applying the conversion function to map. flatMap does the very same thing and is needed for the proper function of for-comprehensions.

The next ingredient is a seed of simple generators:

val genBool = Generator( () => Random.nextBoolean )

val genInt = Generator( () => Random.nextInt )

val genLong = Generator( () => Random.nextLong )

And now let's combine them to generate some compound type:

case class A(x : Boolean, y : Int, z : Long)

```
 
```
```
val genA = for {
```
```
    x <- genBool
```
```
    y <- genInt
```
```
    z <- genLong
```
```
  } yield A(x, y, z)
```
```
 
```
```
val someAs : IndexedSeq[A] = genA(12)
```

The for-comprehension basically binds the three arguments to A's constructor to the three generators, thereby creating a new generator. The result is then exemplarily applied 12 times to generate a sequence of twelve possible values of type A. The monadic structure enables using this pattern without limit to construct arbitrarily complex generators. And this should be compared to my previous naive approach:

val someAs = 0 to 10 map ( x => A(true, Random.nextInt, 1234L) )

val someMore = 0 to 10 map (x => A(false, 42, Random.nextLong) )

Posted by Dr. Roland Kuhn in Scala at 18:09 | Comments (2) | Trackbacks (0)

Trackbacks

Trackback specific URI for this entry

No Trackbacks

Comments

Display comments as (Linear | Threaded)

Hi,

Thanks for the nice article!

I am new to functional programming and I have the following question: why the monadic approach should be preferred to: val someAs = 0 to 10 map ( x => A(Random.nextBoolean, Random.nextInt, Random.nextLong) ) ? This one is much simpler to code/understand/maintain.

#1 Cyril on 2011-05-02 00:15 (Reply)

In the end my code leads to the execution of the same commands, this is true, and I started out exactly in the same way as you propose. But then I wanted to combine generators for A's and B's and some primitive types into a generator for C's, which suddenly produced more visual clutter than I liked, so I started thinking and remembered that composition is the one thing monads should be good at, so I tried it.

In my specific use case this approach actually did save some typing and the result looks less confusing.

#1.1 rk on 2011-05-02 09:09 (Reply)

Add Comment

Name
Email
Homepage
In reply to
Comment	You can use [geshi lang=lang_name [,ln={y\|n}]][/geshi] tags to embed source code snippets. Markdown format allowed Standard emoticons like :-) and ;-) are converted to images. E-Mail addresses will not be displayed and will only be used for E-Mail notifications. To prevent automated Bots from commentspamming, please enter the string you see in the image below in the appropriate input box. Your comment will only be submitted if the strings match. Please ensure that your browser supports and accepts cookies, or your comment cannot be verified correctly. Enter the string from the spam-prevention image above:
	Remember Information? Subscribe to this entry