So, a bit of poking around looking at the database dumps and Special:Export for wikidatawiki
For example, for Obama, we get something that starts like:
<text xml:space="preserve" bytes="8538">{"label":{"en":"Barack Obama","fr":"Barack Obama","ar":"\u0628\u0627\u0631\u0627\u0643 \u0623\u0648\u0628\u0627\u0645\u0627","ru":"\u0411\u0430\u0440\u0430\u043a \u041e\u0431\u0430\u043c\u0430","nb":"Barack Obama","it":"Barack Obama","de":"Barack Obama","be-tarask":"\u0411\u0430\u0440\u0430\u043a \u0410\u0431\u0430\u043c\u0430","nan":"Barack Obama","ca":"Barack Obama"},"description":{"en":"President of the United States of America
Full history for Q1-Q100 is currently 76.1MB. 7z turns that into 887KB
I'm going to have a poke around at some other larger exports done via shell.
I'm just wondering/thinking there might be a better way to represent this and override the backup handlers and produce a better backup format. Not high priority, but something to think about...
Version: unspecified
Severity: enhancement