Page MenuHomePhabricator

import script is broken (or dump corrupted?)
Closed, ResolvedPublic

Description

Author: elwp

Description:
I tried to import the dump of the Dutch Wikipedia (with complete
page histories) and got this error after some time:

[...]
4600 (1.6539432896511 pages/sec 74.894507237181 revs/sec)
Content-type: text/html
X-Powered-By: PHP/4.3.11

<br />
<b>Fatal error</b>: Call to a member function on a non-object in
<b>/wiki/software/mediawiki/phase3-cvs260705/maintenance/importDump.php</b> on
line <b>47</b><br />


Version: 1.5.x
Severity: normal
OS: Linux
Platform: PC

Details

Reference
bz2979

Event Timeline

bzimport raised the priority of this task from to Medium.Nov 21 2014, 8:42 PM
bzimport set Reference to bz2979.
bzimport added a subscriber: Unknown Object (MLST).

elwp wrote:

This will most likely prevent the script from crashing, but are you sure no
revisions or entire pages will be lost then? When handleRevision() is called,
this indicates IMO that the script has found a revision, and that it hasn't been
parsed correctly if getTitle() doesn't return a valid title object.

Possibly due to invalid title, long title, or funny configuration.

As of r20828, invalid titles should be gracefully skipped over in an import;
a warning will be printed to stdout (w/ importDump.php) or web output
(Special:Import)