Page MenuHomePhabricator

Parsoid: Things that look like <pre> tags (e.g. <President>) are treated as such
Closed, ResolvedPublic

Description

$ (echo ':Barack Obama <President> of the United States'; echo ':Joe Biden') | node js/tests/parse.js

WARNING: DSR inconsistency: cs/s mismatch for node: PRE s: 25; cs: 19 <body data-parsoid='{"dsr":[0,58,0,0]}'><dl data-parsoid='{"dsr":[0,58,0,0]}'><dd data-parsoid='{"dsr":[0,58,1,0]}'>Barack Obama <pre data-parsoid='{"stx":"html","a":{"sident":null},"sa":{"sident":""},"dsr":[14,58,11,6]}'> of the United States :Joe Biden </pre></dd></dl></body>

The expected result would be something like <dl><dd>Barack Obama &lt;President&gt; of the United States</dd><dd>Joe Biden</dd></dl> . There should be no pre tags, and the colon should not appear as literal text.


Version: unspecified
Severity: normal
See Also:
https://bugzilla.wikimedia.org/show_bug.cgi?id=57649

Details

Reference
bz57650

Event Timeline

bzimport raised the priority of this task from to High.Nov 22 2014, 2:24 AM
bzimport added a project: Parsoid.
bzimport set Reference to bz57650.

Wow. Seems to be triggered by the unknown <President> tag. Could this be related to the recent pre work?

Change 99031 had a related patch set uploaded by Arlolra:
Attributes, if present, should be preceded by a space

https://gerrit.wikimedia.org/r/99031

Change 99031 merged by jenkins-bot:
Attributes, if present, should be preceded by a space

https://gerrit.wikimedia.org/r/99031

Change 101333 had a related patch set uploaded by GWicke:
Attributes, if present, should be preceded by a space

https://gerrit.wikimedia.org/r/101333

Change 101333 merged by GWicke:
Attributes, if present, should be preceded by a space

https://gerrit.wikimedia.org/r/101333