catimages from compat should be moved to be a separate python package that depends on pywikibot core, and a whole lot of other libraries.
Checklist for dependencies of catimages.py
This table has moved to https://commons.wikimedia.org/wiki/User:AbdealiJK/file-metadata/Dependencies please update there!
Package name | PYPI package | Ubuntu package | Py2.6 | Py3.5 | CI |
numpy | yes | yes | yes | yes | |
scipy | yes | yes | yes | yes | |
cv | deprecated and replaced by cv2 | ||||
cv2 | yes for py2 | yes | yes | yes | |
pyexiv2 | deprecated by gexiv2. But preferably use exiftool as it's more complete | ||||
gi (new dep) | yes (py2, py3) | yes | yes | yes (gnome-continuous) | |
gtk | Only used to find intersection of Rectangles. Preferably remove this dep? | yes (py2, py3) | yes | yes | yes (gnome-continuous) |
rsvg | yes (py2, py3) | yes | yes | yes (gnome-continuous) | |
cairo | yes (py2, py3) | yes | yes | yes (gnome-continuous) | |
magic | yes | yes | yes | yes | |
jseg | This is currently a zip file. Pypi pkg needed | ||||
jseg/jpeg-6b | Can Pillow be used instead ? jpeg-6b seems to be a zip | ||||
_music21 | Do we still need the patch ? If not, use pypi | yes | |||
opencv (own) | Do we still need this ? (haarcascade) | ||||
pydmtx | yes, used for QR Codes. Probably use OpenCV instead ? | ||||
py_w3c | yes, but use requests/bs4 instead ? | ||||
_zbar | Use a new library for barcodes. Not been updated since 2010 | ||||
_bob | Do we still need the patch ? If not use pypi | yes | |||
xbob_flandmark | yes, But flandmark is deprecated for clandmark | no | |||
bob.ip.flandmark | yes, A newer xbob.flandmark, But deprecated for clandmark | yes | |||
py_flandmark | no. Incomplete python bindings are installed with clandmark | no | |||
pywt | yes | yes | |||
slic | This is currently a zip file. Use vlfeat instead ? | ||||
vlfeat | yes Probably use this instead of clandmark and slic. | yes | yes | yes | |
yaafelib | No pypi pkg. Use librosa as alternative maybe? - as it has pypi | yes | |||
matplotlib | yes | yes | yes | yes | |
pycolorname | yes | yes | yes | yes | |
Pillow | yes | yes | yes | yes | |
Binary tools:
- exiftools
- convert (imagemagick)
- pdftotext
- pdfimages
- ffprobe
List of analysis done by catimages (Categories populated and metadata being analyzed): https://etherpad.wikimedia.org/p/Zl7V7KuK7J
Details:
Primary mentor: @DrTrigon
Co-mentor: @jayvdb
Other mentors: (optional, Phabricator username)
Skills: python and computer vision
Estimated project time for a senior contributor: 2-4 weeks
Microtasks: T76211 T128946 T67192
Conpherence: Z360, Z441
Meetings: T133762