diff options
author | David Phillips <david@sighup.nz> | 2019-09-14 16:22:07 +1200 |
---|---|---|
committer | David Phillips <david@sighup.nz> | 2019-09-14 16:22:27 +1200 |
commit | 8f86ef32dff18c0b6499bc7d934e222990451c32 (patch) | |
tree | 478751c61c4c07c6ab9a8748468a59862acf2aa0 /test/test_men.t | |
parent | ea6db8e62753a011321da89439c650f70bd1d032 (diff) | |
download | idalius-8f86ef32dff18c0b6499bc7d934e222990451c32.tar.xz |
URL_Title: Allow entities and wchars mixed in titles
This patch moves the HTML entity decoding until after the raw bytes from the
HTML document are translated through charsets. Previously, entities were used
as decoded by the HTML parser into UTF-8, which meant that non-UTF-8-encoded
strings from documents could become mixed with UTF-8 characters, making
the subsequent character encoding transformation impossible to perform
correctly.
Diffstat (limited to 'test/test_men.t')
0 files changed, 0 insertions, 0 deletions