Page MenuHomePhabricator

DBQ-207 Metadata of Wikimedia commons
Closed, ResolvedPublic

Description

This issue was converted from https://jira.toolserver.org/browse/DBQ-207.
Summary: Metadata of Wikimedia commons
Issue type: Task - A task that needs to be done.
Priority: Minor
Status: Done
Assignee: Hoo man <hoo@online.de>


From: karthik Sripal <karthik_sripal@yahoo.co.in>

Date: Sun, 07 Jul 2013 05:00:50

Hi,

I would like to have the below listed Metadata fields of all the images uploaded to Wikimedia commons.

File name
Camera manufacturer
Camera model
Date and time of data generation
Horizontal resolution
Vertical resolution
Full Resolution
Software used
File change date and time
Date and time of digitizing
Flash
Latitude
Longitude
Altitude
GPS time (atomic clock)
Reference for direction of image
Direction of image

Could any of you please query this data for me - I would prefer this result set data in CSV format - but open to any of the available standard formats.

Thanks!


Version: unspecified
Severity: minor

Details

Reference
bz59489

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 2:34 AM
bzimport set Reference to bz59489.

From: Hoo man <hoo@online.de>

Date: Thu, 07 Nov 2013 22:52:38

Is this still needed? If so, I'll resolve that soon.

SQL:

SELECT img_name, img_metadata FROM image;

img_metadata needs to be unserialized using PHP then, that's easy to do but will take a bit.


From: karthik Sripal <karthik_sripal@yahoo.co.in>

Date: Fri, 08 Nov 2013 00:22:53

Yes please , thank you


From: Hoo man <hoo@online.de>

Date: Sat, 09 Nov 2013 01:09:48

Hi, please notice that I didn't do any further preprocessing of the meta data, I've just dumped it as a JSON structure.

I've chose the following format (delimiter is a tab):
Filename Json with metadata

Result:
https://tools.wmflabs.org/hoo/dbq/dbq-207.gz (about 11GiB unpacked)


From: karthik Sripal <karthik_sripal@yahoo.co.in>

Date: Sat, 09 Nov 2013 01:16:21

Thank you very much, This helps ![][1]

[1]: https://jira.toolserver.org/images/icons/emoticons/smile.gif

This bug was imported as RESOLVED. The original assignee has therefore not been
set, and the original reporters/responders have not been added as CC, to
prevent bugspam.

If you re-open this bug, please consider adding these people to the CC list:
Original assignee: hoo@online.de
CC list: hoo@online.de