Page MenuHomePhabricator

Update plural data to CLDR version 26
Closed, ResolvedPublic

Description

CLDR version 25 is available now(http://unicode-inc.blogspot.in/2014/03/cldr-version-25-released.html).

We need to update the plural rule data to new version.


Version: unspecified
Severity: enhancement
See Also:
https://bugzilla.wikimedia.org/show_bug.cgi?id=71899

Details

Reference
bz62861

Event Timeline

bzimport raised the priority of this task from to High.Nov 22 2014, 3:05 AM
bzimport set Reference to bz62861.
bzimport added a subscriber: Unknown Object (MLST).

Change 121025 had a related patch set uploaded by Nikerabbit:
Update plurals.xml to CLDR 25

https://gerrit.wikimedia.org/r/121025

(In reply to Gerrit Notification Bot from comment #1)

Change 121025 had a related patch set uploaded by Nikerabbit:
Update plurals.xml to CLDR 25

https://gerrit.wikimedia.org/r/121025

Still pending. Are we just going to skip CLDR 25? :-) CLDR 26 comes out "soon".

(In reply to Nemo from comment #2)

Still pending. Are we just going to skip CLDR 25? :-) CLDR 26 comes out
"soon".

It is quite possible that we have to skip CLDR 25 since LE team does not have capacity to address this right now.

While we are struggling to find time to work on this, CLDR 26 released http://cldr.unicode.org/index/downloads/cldr-26

Repurposing the bug for CLDR 26.

(In reply to Santhosh Thottingal from comment #5)

While we are struggling to find time to work on this, CLDR 26 released
http://cldr.unicode.org/index/downloads/cldr-26

Which says:

Plurals

Expanded the number of locales with plural ranges to 73 (af, am, ar, az, bg, bn, bs, ca, cs, cy, da, de, el, en, es, et, eu, fa, fi, fil, fr, gl, gu, he, hi, hr, hu, hy, id, is, it, ja, ka, kk, km, kn, ko, ky, lo, lt, lv, mk, ml, mn, mr, ms, my, nb, ne, nl, pa, pl, pt, ro, ru, si, sk, sl, sq, sr, sv, sw, ta, te, th, tr, ug, uk, ur, uz, vi, zh, zu)
Added ordinals for 3 locales  [bs, dsb, hsb], and cardinals to 2 locales [dsb, hsb]. Modifications for 2 locales.

https://gerrit.wikimedia.org/r/#/c/161920/ is the new patch for this based on CLDR 26

Since the Russian plural rules changes are breaking change, we need to plan migration of Russian localization by coordinating with translatewiki.net and translators.

(In reply to Santhosh Thottingal from comment #7)

https://gerrit.wikimedia.org/r/#/c/161920/ is the new patch for this based
on CLDR 26

Can we have a readable diff as Niklas had made for I64eb97cd (rMW47f4536ca988) or is that harder now?

Since the Russian plural rules changes are breaking change, we need to plan
migration of Russian localization by coordinating with translatewiki.net and
translators.

The message sent for CLDR 24 Russian rules was
https://translatewiki.net/wiki/Thread:Portal_talk:Ru/Plural_changes_in_many_languages

So in practice this means that MediaWiki used to have one-few-many, swapped to one-many-few with CLDR 24, and now goes back to one-few-many again, with the addition of a fourth form for... numbers with decimals?

http://unicode.org/cldr/trac/ticket/6932 doesn't shed much light, it's only said that removing one of the four rules was a mistake in CLDR 24.

Change 121025 abandoned by Nikerabbit:
Update plurals.xml to CLDR 25

https://gerrit.wikimedia.org/r/121025

Change 166959 had a related patch set uploaded by Nikerabbit:
Add a script to process CLDR plural rule changes

https://gerrit.wikimedia.org/r/166959

(In reply to Gerrit Notification Bot from comment #11)

Change 166959 had a related patch set uploaded by Nikerabbit:
Add a script to process CLDR plural rule changes

https://gerrit.wikimedia.org/r/166959

Heh https://dev.translatewiki.net/w/i.php?title=MediaWiki:Historywarning/prg&diff=prev&oldid=5022292 . For messages which were not touched since they were last fuzzied, would it be possible to just restore the revision before CLDR 24? Translators would certainly see it in a jolly way if we could describe it as a cleanup of the CLDR 24 mess rather than a new episode.

Nemo: there is no such thing for prg or tl. For ru and others we can do the change automatically.

(In reply to Niklas Laxström from comment #13)

Nemo: there is no such thing for prg or tl.

Ok.

For ru and others we can do the
change automatically.

Won't they be fuzzied?

(In reply to Nemo from comment #14)

Won't they be fuzzied?

No. Please see the commit message of https://gerrit.wikimedia.org/r/166959

Change 168255 had a related patch set uploaded by Nemo bis:
Disable l10nupdate for the duration of CLDR 26 plural migration

https://gerrit.wikimedia.org/r/168255

We are currently planning to deploy this to WMF sites during 2014-10-27 – 2014-11-06. L10nupdate will be disabled during that period. I'll try to get this into next tech news. We are planning to send announcements to translatewiki.net and some email lists on Monday to let people know about l10nupdate and requesting translators to check and update translations. We will also be running scripts at translatewiki.net on Monday or Tuesday to automate the conversion where possible.

Change 161920 had a related patch set uploaded by Nikerabbit:
Update plural data to CLDR 26

https://gerrit.wikimedia.org/r/161920

Change 168255 merged by Alexandros Kosiaris:
Disable l10nupdate for the duration of CLDR 26 plural migration

https://gerrit.wikimedia.org/r/168255

Change 161920 merged by jenkins-bot:
Update plural data to CLDR 26

https://gerrit.wikimedia.org/r/161920

I'll leave this open until LU is re-enabled.

(In reply to Niklas Laxström from comment #21)

I'll leave this open until LU is re-enabled.

Which was done and not reverted by now. https://gerrit.wikimedia.org/r/171516

Change 166959 abandoned by Nikerabbit:
Add a script to process CLDR plural rule changes

Reason:
One time use script. Can be restored if we have similar needs in the future.

https://gerrit.wikimedia.org/r/166959