Author: j.schauer
Description:
while i was doing a wikipedia dump on my local machine i got segfaults for three segfaults for three specific ids in refreshLinks.php and dumpHTML.php.
as this error occurred in both scripts i assume that they are not specific to these maintenance scripts.
here is my php version:
PHP 5.2.4-2ubuntu5.1 with Suhosin-Patch 0.9.6.2 (cli) (built: May 9 2008 16:54:00)
Copyright (c) 1997-2007 The PHP Group
Zend Engine v2.2.0, Copyright (c) 1998-2007 Zend Technologies
with Xdebug v2.0.2, Copyright (c) 2002-2007, by Derick Rethans
as you can see i installed xdebug to get a trace for my errors and to increase the maximum function nesting level but the error also occurs without xdebug.
and my machine:
Linux mediawiki 2.6.24-16-generic #1 SMP Thu Apr 10 12:47:45 UTC 2008 x86_64 GNU/Linux
i run ubuntu 8.04 hardy and use the latest mediawiki from svn (r35317) but the segfaults were also there when i checked it out a week ago.
to reproduce the segfaults just import this xml dump with mwdumper:
http://download.wikimedia.org/dewiki/20080422/dewiki-20080422-pages-articles.xml.bz2
and then do:
php extensions/DumpHTML/dumpHTML.php -s <startid>
and
php maintenance/refreshLinks.php <startid>
where startid is one of the following numbers:
80377
180324
185305
you will get the segfault immediately with both scripts!
Version: 1.13.x
Severity: normal
OS: Linux
Platform: PC