What we don't know
How do the white-space characters in the output correspond to those in the input?
- Several different correspondences are possible
- Even if we pick one correspondence, there is no guarantee that it is the right one
A good solution will allow alternatives