Caoilte O'Connor

Bending JAXB to the will of Scala (Part 2 of 2)

2014-10-06T00:00:00+00:00

This blog post is the second in a two part series that bottoms out solutions for the corner cases that I ran into whilst getting JAXB to work in Scala. I am indebted to Martin Krasser’s blog post on JAXB for providing an essential first primer on the topic.

This week I enumerate the difficulties with lists and the work arounds I found for making them manageable. You can still read the first part of the series (on optionals) here.

Lists of the reasons to consider giving up on JAXB

Lists in JAXB are where the gaps between Java and Scala start to become painful. They’re also where I gave up for a week in disgust and found solace in signing up for Erik Meijer’s new Haskell MOOC. I eventually succeeded with two different ways of modeling lists, only the second of which turned out suitable for implementing the Atom specification.

Containerized Lists

Martin Krasser’s JAXB blog post doesn’t explain how to model lists, but does link to extensive and very helpful examples in his event sourcing project.

With a little bit of tweaking I managed to get them working for me,

This code will turn,

Entry("101", 
    List(
        Person("Caoilte O'Connor"),
        Person("Martin Krasser")
    )
)

into,

<?xml version="1.0" encoding="UTF-8"?>
<entry>
    <id>101</id>
    <authors>
        <person>
            <name>Caoilte O'Connor</name>
        </person>
        <person>
            <name>Martin Krasser</name>
        </person>
    </authors>
</entry>

The most notable property of the xml is the nesting of list elements within an authors element. A lot goes on in order to achieve this containerization, so let’s break it down into a list of bullets,

JAXB doesn’t know how to handle Scala Lists, but it can map Java Lists and so the AbstractListAdapter turns a Scala List into a container of a Java List and vice versa.
Unfortunately one XmlAdapter cannot encapsulate the entire abstract transformational logic as it has for previous examples. The AbstractList trait describes the properties that the container requires.
The Person companion object then implements all of the abstract types described previously. This code is entirely boiler plate and seems like an excellent opportunity to re-watch Dave Gurnell’s Macros for the Rest of Us talk.
Finally, I have included the Person Case Class and an Entry Case class that incorporates a list of Persons.
The only change that I made to Martin Krasser’s code was because of the MOXy bug mentioned in the previous section, which required me to reverse the order of the type parameters on AbstractListAdapter.

Unfortunately, despite how clever it is, this code isn’t suitable for the Atom Feed use case because that requires XML to be produced without the container element and this requires a completely different approach which I will discuss in the next section.

Mapping Scala Collections to Java Collections for Containerless Lists

Unfortunately the XmlAdapter can only be used to model the transformation for elements in a list. You cannot use it to model the transformation of the list itself. We got around this in the previous section by transforming a Scala List (which JAXB doesn’t recognise at all) into a container of lists. If you try and turn the Scala List into a Java List, JAXB falls over hard. (It recognises that the property will end up as a Java List but ignores the adapter and attempts to cast the Scala List to a Java Collection directly.)

Because of this limitation the entire approach needs to be re-thought. I tried utilizing other JAXB annotations (eg @XmlElementWrapper, @XmlPath and @XmlTransformation) but couldn’t get any of them to work with Scala. I had a similar lack of success with other Scala Collection types (Seq, ArraySeq and WrappedArray). JAXB managed to marshall some to XML but was unable to unmarshall any back to types.

The only type which worked was the Scala Array. This is probably because Array has some unique (and to be honest, rather awkward) properties. Unlike most (perhaps all?) other Scala types, Array compiles down to a Java Array in byte code and is indistinguishable to Java code. This has pros and cons,

On the plus side,

JAXB knows how to map a Java Array to xml and back again
Scala makes all of its Collection mapping goodies available to an Array
Arrays are more performant for very large data sets (I’m clutching at straws here)

On the minus side,

Arrays are mutable. Case Classes promote the use of immutable data because it is easier to reason with and it is a shame to be forced to give that up.
Because Arrays are really from Java, Scala cannot override the equals, hashCode and toString methods. This means they are pretty much the only type in Scala to compare with reference equality rather than value equality. Paul Phillips gives a great explanation here about why that’s a good thing, but it still hurts and causes all sorts of problems using them in Case Classes (as we shall see).

The comparison with reference equality can be worked around. There’s a great exploration of the possibilities here. Making Arrays work in a Case Class context is trickier, because a Case Class is really just a boilerplate generator and the boilerplate generation makes no special exception for Arrays and the Java equals, hashCode and toString methods they expose.

I spent some time figuring out how to work around this. At first I got very excited to discover that Case Classes support two parameter lists and that only parameters in the first parameter section are considered for equality and hashing. (HT: Jason Zaugg, again.) However, there is no way to partially extend equals/toString/hashCode and so this didn’t gain me anything as I still had to override everything. A better solution presented itself after I saw this this manual reimplementation of a case class on a Stack Overflow question about overriding the equals method on a case class. My eventual (and I believe, optimal) solution borrowed several tricks from this discussion,

This code will turn,

Entry("101", 
    Array(
        Person("Caoilte O'Connor"),
        Person("Martin Krasser")
    )
)

into,

<?xml version="1.0" encoding="UTF-8"?>
<entry>
    <id>101</id>
    <author>
        <name>Caoilte O'Connor</name>
    </author>
    <author>
        <name>Martin Krasser</name>
    </author>
</entry>

Let’s consider what is going on,

One of the lesser known features of a Case Class is that the Scala compiler auto-generates it an implementation of the Product Trait. The Product Trait provides an interface for programmatically accessing the attribute values of a class. The Scala Compiler then auto-generates toString and hashCode methods that rely on the Product Trait. This gives us a code seam where we can interpose our own implementation of the Product trait for a Case Class, whilst still benefitting from other auto-generated goodness.
The only difference between our implementation of Product and the auto-generated implementation is that we override Arrays returned by the productElement method in a Wrapped Array. The Wrapped Array is identical to a normal Array except that it overrides toString, hashCode and equals with more useful implementations.
The manual reimplementation of a case class showed equals being implemented by the ScalaRuntime._equals method, but this is not actually the case with real case classes auto-generated by the Scala Compiler. I asked why on the scala-user list and was relieved to learn that it was a performance optimisation. Consequently, there is nothing to stop us from overriding equals with this ourselves.

It is worth re-considering the pros and cons at this point. On the plus side,

We’ve succeeded in defining a Collection containing Case Class that JAXB can marshall to XML and back again into the same Case Class.
The Case Class we have defined doesn’t break any of the conventions that the Case Class provides. It still has the same constructor, pattern-matching, equality and, product attribute properties expected.
The hand rolled boiler plate is easy to understand and relatively simple to spot errors in.
With a little work it might be possible to define a macro to do all of the auto-generation for us.

On the minus side,

With larger Case Classes the boiler plate becomes overwhelming.
Arrays are mutable and many people are uncomfortable using mutable classes in a Case Class.
Our equals method uses the ScalaRuntime._equals helper method which is described on the Javadoc as being outside the API and subject to change or removal without notice. (In practice the method is three lines long and simple to re-implement.)

Afterthoughts

I have succeeded in bending JAXB to my use-case, but at what cost? If I use JAXB Annotated Case Classes for my domain model then I,

have to suffer the heavy miasma of annotation pollution
am constrained to the use of Arrays to describe lists and have to re-implement a large part of each case class as a result
am at the mercy of any future Scala/Java interop bugs I find in the JAXB space (consider the map)

On reflection, JAXB is not a good fit for Scala in most scenarios - especially ones where the specification is complex or could evolve. As I have nearly completely implemented the Atom spec it isn’t so bad, but I would think twice before using it again.

Bending JAXB to the will of Scala (Part 1 of 2)

2014-09-29T00:00:00+00:00

Martin Krasser has written the definitive guide to JAXB in Scala. This blog post is the first in a two part series that bottoms out solutions for the corner cases that I ran into whilst appropriating all of his hard work.

This week I’ll focus on optionals. Next week I will cover lists.

Background

The use-case that I am exploring in this series of blog posts was modelling the Atom Syndication Format in appropriately typed Scala Case Classes that serialize to the corresponding XML as cleanly and concisely as possible. My (nearly complete) implementation is available here.

I wanted to avoid scala-xml because of the (perhaps un-justified) bad press it gets. Jackson and json4s both have XML bindings, but they are not flexible enough to accurately map the complete Atom Specification. The only other well supported Scala xml library that I am aware of is scalaxb. I have used that before and it is very good. However,

It relies on XSDs and although there are Atom XSDs out there, none are official.
The Case Classes generated are not as friendly to use as hand rolled could be (as evidenced by these examples of Entries generated from different XSDs I found).
Under the hood it is still using scala-xml

Since none of the Scala libraries perfectly matched my requirements I decided (perhaps foolishly) to try falling back to the de-facto Java standard, JAXB.

Basic Marshalling

Basic types (strings, nested types, un-parameterised custom types) are very simple to map once you understand how to map annotations correctly in Scala (and as Martin Krasser explains very well in his blog post).

The Troubles with Optionals

Martin also gives a great example of binding an optional string to a case class but problems emerge when you actually use it.

None Behaviour on Strings

I expected,

Person("Martin Krasser", None, 30 /* haha */)

to serialize to

<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<person>
  <name>Martin Krasser</name>
  <age>30</age>
</person>

ie to leave out the Option[String] username field because it was set to None. However, despite running JDK7u45 (Martin thought JDK7u4 and up should contain a fix for this bug) I still got a nasty NPE. And then, even when I upgraded to the latest Reference Implementation version of JAXB (2.2.7) the case class would marshal to XML but left in an empty element for username despite the XMLAdapter specifically returning null.

<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<person>
  <name>Martin Krasser</name>
  <username/>
  <age>30</age>
</person>

An unneccessary empty <username/> element breaks my requirement for clean and concise xml.

Optional Inner Value-Only Types

More serious problems emerged when I tried to map one very specific optional inner type. Consider the following code,

It should marshal the following case class

Person("Caoilte O'Connor", None)

<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<person>
  <name>Caoilte O'Connor</name>
</person>

However, for me it NPEs when using jdk7u45 (with or without adding the RI JAXB 2.2.7 as an explicit dependency). The problem is very specific and I was only able to reproduce it for types with one attribute that was @xmlValue annotated. (I have raised the bug as JAXB-1052.)

EclipseLink MOXy : A Solution for Optionals

The only solution I found for these problems was to switch JAXB implementation (as advised by Blaise Doughan in this Stack Overflow question). EclipseLink MOXy fixes both of the problems described above.

Switching was trivial. I just added a dependency on "org.eclipse.persistence" % "org.eclipse.persistence.moxy" % "2.5.2" and put the following jaxb.properties file

javax.xml.bind.context.factory=org.eclipse.persistence.jaxb.JAXBContextFactory

under the same package as the case classes that I was marshalling, except in the resources directory.

As an added bonus, MOXy also doesn’t print the standalone attribute in the xml fragment, which was breaking my requirement for clean and concise xml and which Metro (the Reference Implementation) cannot be configured to remove. My person case class would now marshal as,

<?xml version="1.0" encoding="UTF-8"?>
<person>
  <name>Caoilte O'Connor</name>
</person>

Perfectly concise!

Mapping Other People’s Types to Optionals

The previous two examples describe the problems I found with using OptionAdapter to map a type that JAXB knows how to marshal/unmarshal. Taking a custom type that JAXB doesn’t know how to manage (ie one you have already created your own XmlAdapter for) and making it optional adds a whole new dimension to the problem space. You cannot simply pass your custom type as a type parameter to the OptionAdapter and let JAXB figure out the chain of transformations required (although I did try). I came up with the following solution,

There’s nothing new, or even particularly pleasing about this code, but it is interesting because it was the most concise way that I was able to take an existing XmlAdapter (DateTimeAdapter) and make it optional.

I have tested that the bug described in the JavaDoc of CustomOptionAdapter has been fixed in the trunk of EclipseLink MOXy.

Afterthoughts

I decided to write about JAXB in Scala because I wanted to highlight the problems that emerge when you scratch the surface with a real life use-case. That doesn’t mean I endorse the use of JAXB in Scala. I’m still very ambivalent about the library. Exactly why might become clearer in part two of this two part series when I scratch a little deeper on the problems for JAXB in Scala and talk about lists.