Page MenuHomePhabricator

Database statistics for Wikimedia sites
Closed, ResolvedPublic

Description

I think having information on database size related information would be quite useful. Such has how many [giga]bytes worth of text and how many [giga]bytes worth of media.

Perhaps bandwidth data too? (Daily/monthly site(language edition and etc)-wide download/upload traffic). What percentage of our traffic goes to de.wiki is a curious question I am asking to myself now. :)


Version: unspecified
Severity: enhancement

Details

Reference
bz11362

Event Timeline

bzimport raised the priority of this task from to Medium.Nov 21 2014, 10:00 PM
bzimport set Reference to bz11362.

matthew.britton wrote:

Combined bandwidth data for all Wikimedia sites is already available ( e.g. at http://hemlock.knams.wikimedia.org/~leon/stats/trafstats/trafficstats-weekly.png ) with a breakdown by cluster, but no data on individual sites.

Total amount of text in all current revisions of all article pages might make a nice figure to do stuff with (post everywhere and brag about, I guess, if you're into that kind of thing).

Alexa says 57% of vistors to wikipedia.org go to en.wikipedia.org, 16% to es.wikipedia.org and 4% to de.wikipedia.org, but Alexa's figures aren't exactly renowned for their reliability.

erikzachte wrote:

Several relevant reports that breakdown traffic volume have been created since this bug was filed.

Page views per month per language (for all projects):
http://stats.wikimedia.org/EN/TablesPageViewsMonthly.htm

Page views and edits per month per country and per language project (Wikipedia only): http://stats.wikimedia.org/wikimedia/squids/SquidReportsCountriesLanguagesVisitsEdits.htm

Breakdown of page requests per resource, origin, destination, etc (all projects):
http://stats.wikimedia.org/wikimedia/squids/SquidReportRequests.htm

The dump progress report provides info on (compressed) size of raw data dumps: http://download.wikimedia.org/backup-index.html

The monthly report card provides counts per tytpe of media on Commons:
http://stats.wikimedia.org/reportcard/ (section 'Commons files')

I'm confident that more reports on different aggregation levels about data and traffic volumes will follow. For now I'd like to close this open ended suggestion, as much has been accomplished since.

[mass-moving wikistats reports from Wikimedia→Statistics to Analytics→Wikistats to have stats issues under one Bugzilla product (see bug 42088) - sorry for the bugspam!]