T106578: Update Sanitizer to match legal HTML5 character entities.
authorC. Scott Ananian <cscott@cscott.net>
Wed, 22 Jul 2015 20:07:27 +0000 (15:07 -0500)
committerTim Starling <tstarling@wikimedia.org>
Tue, 18 Aug 2015 23:05:10 +0000 (23:05 +0000)
commitbc75784cbb6d75a244c1d28dd99ac34baf930fdb
tree2024f6c01ed0047e54274583e0e1b4c87eb47577
parent87eebf8dd5ec4564aa1cfca4fe7e53fbd29da3d5
T106578: Update Sanitizer to match legal HTML5 character entities.

Invalid HTML5 character entities become instances of UTF8_REPLACEMENT,
so we also ensure that checkCSS notices this and emits the proper
human-friendly sanitization notice.

Change-Id: I76cef7c772b1e3eba0af8dab6403e9100beab03a
includes/Sanitizer.php
tests/parser/parserTests.txt