Page MenuHomePhabricator

XML import and export for use within wiktionary
Closed, DeclinedPublic

Description

Author: gerardm

Description:
It would be really usefull to be able to import XML dictionary, glossary data
that are open content into wiktionary.

It would be equally usefull to be able to export XML data about wiktionary content.

This would enhance the open character of wiktionary and it would open up the
"proprietary" nature of our content.

Thanks,

GerardM

Version: unspecified
Severity: enhancement

Details

Reference
bz381

Related Objects

Event Timeline

bzimport raised the priority of this task from to Low.Nov 21 2014, 6:48 PM
bzimport set Reference to bz381.
bzimport added a subscriber: Unknown Object (MLST).

jeluf wrote:

XML is a standard for how to format data. Just like CSV or other formats, it
does not provide information about the meaning of data. "import xml" is a very
vague request.

gerardm wrote:

There are at least two existing XML definitions for things like a glossary. The
people from GEMET produce there thesaurus in XML. This would cover a lot of the
ground that is required. There propably also is a XML standard for dictionary
content. I will learn that from the GEMET people I trus.

Thanks,

GerardM

sfkeller wrote:

For thesaurus data exchange I would propose the Zthes standard which is based on some ISO
standards.

It seems to be most wiedly used including ORACLE (?) and our projects www.geometa.info and
www.integis.ch/tsaurus (see e.g. http://www.geometa.info/report/geometa-info-6-3.html).

gerardm wrote:

There are several standards there is one for thesaurus use, one for glossary use
and several for dictionary use. Important in the selection is to find that it
fits what Wikionary will require. A thesaurus typically does not include
ethymology or translations.

wikipedia wrote:

Why not just export the data in the simplest XML format we can think of that
includes all the data. If people want it in some other standard format, that's
what XSLT was designed for. Confirming to some other standard straight from
wiktionary will just make it harder to implement competing standards.

wikt.3.connelm wrote:

(cont.)

Import failed: XML import parse failure at line 184308, col 84 (byte 12778764; ""): Invalid document end

  • Served by srv136 in 5.508 secs. --

Import failed: XML import parse failure at line 190401, col 2 (byte 13262462; ""): Invalid document end

  • Served by srv125 in 6.480 secs. --

wikt.3.connelm wrote:

Whoops, sorry. That last comment was intended for bugzilla:9911.

happy.melon.wiki wrote:

Stale, no real activity for four and a half years. We already have data dumps in XML format to export content. Importing data in XML format is best done by a bot script.

  • Bug 21460 has been marked as a duplicate of this bug. ***

I'm going to close this bug again as it is too vague. Besides, in the current day, things like this should be done by bots (for import) and scripts operating on database dumps (for export).