3

I get an exception The entity “nbsp” was referenced, but not declared when parsing valid HTML that contains the &nbsp entity (which makes it invalid XML; I do not control the server) while unmarshalling a HttpEntity into a NodeSeq with spray.httpx.unmarshalling.BasicUnmarshallers.NodeSeqUnmarshaller.

I can probably preprocess the HTML to remove &nbsp, but what is the accepted method for parsing HTML (with &nbsp) with Spray?

kliew
  • 3,073
  • 1
  • 14
  • 25

1 Answers1

2

You might try to write a Custom Unmarshaller that wraps JSoup.

Brian Kent
  • 3,754
  • 1
  • 26
  • 31