Author: jymj2002
Description:
I downloaded the latest version of the spanish articles in 'xml' and the latest version of mwdumper (2008-04-13):
eswiki-20080507-pages-articles.xml.bz2
and I followed the instructions to load it in a mysql database. The exact line I type is:
java -client -classpath mwdumper.jar;mysql-connector-java-3.1.12-bin.jar org.mediawiki.dumper.Dumper "--output=mysql://127.0.0.1/wikidb?user=<user>&password=<password>" "--format=sql:1.5" "C:\eswiki-20080507-pages-articles.xml.bz2"
(where <user> and <password> are correctly especified).
Everything seems to work ok, the output I get is:
1.000 pages (249,004/sec), 1.000 revs (249,004/sec)
and similar lines starting with 2.000, 3.000... till it reaches the line starting with 17.000. At this point I get the following message:
17.000 pages (366,08/sec), 17.000 revs (366,08/sec)
Exception in thread "main" java.io.IOException: java.sql.SQLException: Duplicate entry '0-?' for key 2
(and then the typical exception stack trace).
I think maybe it could be something with the encoding of spanish accents (á, é....) or special characters such as 'ñ', so I tried creating the database with other charsets but I get the same error.
See Also: T11279: mwdumper direct MySQL connection needs to distinguish UTF-8 and compat schemas
Version: unspecified
Severity: normal
OS: Windows XP
Platform: PC