[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
XML files now valid and UTF-8
- To: tvdevel@xxxxxxxx
- Subject: XML files now valid and UTF-8
- From: Andrew Flegg <andrew@xxxxxxxx>
- Date: Wed, 6 Aug 2003 15:14:51 +0100
- User-agent: Mutt/1.3.28i
Hi,
The XML files which caused a problem today, and all ones generated in
future (ie. day 6 tomorrow) will now only have the standard XML entities
encoded (eg. <, >, & and "). All other stuff (such as ) is now
stored as UTF-8. This means that today's five.xml now contains a 2-byte
wide character instead of .
On the user-interface side of things Perl is automagically translating
these to ISO-8859-1 for me, but others may need to check their code and
be careful.
It does mean, however, that all the pre-parsing steps necessary for
MSXML etc. are no longer needed.
Cheers,
Andrew
--
Andrew Flegg -- mailto:andrew@xxxxxxxx | http://www.bleb.org/