Non-word characters shouldn't terminate tag names on the tidy side too
authorArlo Breault <abreault@wikimedia.org>
Wed, 7 Jan 2015 20:46:59 +0000 (12:46 -0800)
committerArlo Breault <abreault@wikimedia.org>
Tue, 3 Feb 2015 20:41:55 +0000 (12:41 -0800)
commit8e8b15afc684e65946b5f6101e74ced518d2eee6
tree7d771e4f72e07362afdc84d4220a43618917d2ec
parentd43e51a42ca248753ef9f274e1a970e0e966af23
Non-word characters shouldn't terminate tag names on the tidy side too

 * Follow up to Iceec404f46703065bf080dd2cbfed1f88c204fa5.

 * The accepted charset is changed to match the HTML5 parsing spec at:
   http://dev.w3.org/html5/spec-preview/tokenization.html#tag-open-state

 * Equivalent in parsoid at I462c336f9a00c8ccd11f3220a8738389e8ba7c7c.

Change-Id: I69cb000538fe195dd77273da5f91697fe1e7d283
includes/Sanitizer.php
tests/parser/parserTests.txt