Page MenuHomePhabricator

U+3000 IDEOGRAPHIC SPACE should terminate external link
Closed, ResolvedPublic

Description

In the above URL, we see how I had to remove the

$ unicode U+3000
U+3000 IDEOGRAPHIC SPACE
UTF-8: e3 80 80 UTF-16BE: 3000 Decimal:  
 
Category: Zs (Separator, Space)
Bidi: WS (Whitespace)
Decomposition: <wide> 0020

from

交通部有做總表:http://www.highwaybus.nat.gov.tw/work/permission_report.htm 

lest it get appended into the link, and not treated just like the ASCII
space next to it. Shouldn't the parser treat both types of spaces the
same in this situation? Can Asian users reasonably be expected to always
remember to terminate in ASCII spaces?
I wonder how Bugzilla will treat that line, 5 lines above this. See also bug 1414.


Version: 1.16.x
Severity: enhancement
URL: http://taizhongbus.jidanni.org/index.php?title=%E4%B8%AD%E5%85%AC%E8%A8%8E%E8%AB%96:%E5%85%A8%E5%9C%8B%E5%85%AC%E8%B7%AF%E8%88%87%E5%9C%8B%E9%81%93%E5%AE%A2%E9%81%8B%E8%B7%AF%E7%B7%9A%E7%B7%A8%E8%99%9F%E5%B0%8D%E7%85%A7&diff=5173&oldid=5164

Details

Reference
bz19052

Event Timeline

bzimport raised the priority of this task from to Medium.Nov 21 2014, 10:40 PM
bzimport set Reference to bz19052.
  • Bug 25409 has been marked as a duplicate of this bug. ***

Updated to cover all unicode characters in the 'separator, space' category.
See r93291 which is pending review.

This was reviewed and should be deployed with 1.19