Normalize Unicode input to normalization form C. Most of the time input
authorBrion Vibber <brion@users.mediawiki.org>
Thu, 2 Sep 2004 07:50:04 +0000 (07:50 +0000)
committerBrion Vibber <brion@users.mediawiki.org>
Thu, 2 Sep 2004 07:50:04 +0000 (07:50 +0000)
commitb687f14131d1ffb7a038a25029f948383b08299e
treee71240962969e78097c6bde19c00a93f27be8c86
parent53e71c17027ebb7a6b070f9e4d71fd346e874b6f
Normalize Unicode input to normalization form C. Most of the time input
is already in this form and it shouldn't take very long to verify it.
There is still optimization to be done though.

Partial fix for http://bugzilla.wikipedia.org/show_bug.cgi?id=240

Will also need to verify correct UTF-8 sequences and strip characters
that are illegal in XML.

Some input may not be going through this verification yet (eg the uploaded filenames)
includes/WebRequest.php