Problem solve Get help with specific problems with your technologies, process and projects.

Sax Parsers

According to XML specifications, character range #xF900 to #xFFFE is not valid. But I tried to use some characters in this range and my SAXParser did not throw any error (I'm using SAX 2.0 - Xerces 3.2.1- XML4J). Does the SAX Parser not follow all the specifications or is it a bug in the parser?
Implementing restrictions on Unicode character ranges is one area where parsers differ a lot - especially in the area of characters that are permitted within tag and attribute names.

Best to check the documentation for your parser.

Note also that it depends how you encoded the characters in your XML. If you use character entity references for example (豈) these are fine in PCDATA. Other characters can be valid in one encoding (such as ISO-8859-1) supported by many parsers, but illegal in other encodings i.e. US-ASCII.

This was last published in July 2002

Dig Deeper on Development implications of microservices architecture



Find more PRO+ content and other member only offers, here.

Have a question for an expert?

Please add a title for your question

Get answers from a TechTarget expert on whatever's puzzling you.

You will be able to add details on the next page.

Start the conversation

Send me notifications when other members comment.

By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy

Please create a username to comment.