Page MenuHomePhabricator

Please upload a 1000+ page manuscript at maximum legible compression for proofreading
Closed, ResolvedPublic

Description

Author: windowpain1234

Description:
Please upload this 1.9 GB 1000+ page PDF scan of a manuscript that will be used for proofreading and transcription (as part of a significant work not on Wikisource as of yet; see the Wikipedia article for "Theophrastus redivivus" for more info on the manuscript). I have taken all measures to carefully compress the file into this size (it was much, much larger before), and further compression would hurt transcription by my best judgment, as the text is in cursive Latin.

The file and description txt:

Thank you very much!
--Ithinkicahn


Version: unspecified
Severity: normal

Details

Reference
bz62423

Event Timeline

bzimport raised the priority of this task from to Medium.Nov 22 2014, 2:53 AM
bzimport set Reference to bz62423.
bzimport added a subscriber: Unknown Object (MLST).

windowpain1234 wrote:

Thanks! Do you know how long it will take for the thumbnails to be generated, or until it will be viewable? It doesn't seem to be viewable at the moment, which I'm guessing is a processing delay.

(In reply to windowpain1234 from comment #2)

Thanks! Do you know how long it will take for the thumbnails to be
generated, or until it will be viewable? It doesn't seem to be viewable at
the moment, which I'm guessing is a processing delay.

Not sure.. The file is pretty large (so first needs copying to the image scalers). That might cause issues/timeouts. There's numerous bugs around for these types of issues. Might be worth just giving it a couple of hours and see if they start appearing

Individual pages at a couple of MB should be fine.

windowpain1234 wrote:

Thanks Sam. One last thing; while setting up the file, I noticed that I missed an error in the scanned pages, specifically the inclusion of 2 duplicate pages in the PDF. I've fixed the error on my side, so would it be alright if we replaced it with this fixed version? Last time, promise. :)

File: https://drive.google.com/file/d/0B3oMA7P6YNsleEJqSkxQQmhSXzA

Very much appreciated

windowpain1234 wrote:

Hey guys, any reason why the file isn't loading yet? None of the pages are loading either; you can download the file, but none of the pages on the actual sites are loading.

Which file? The original at https://upload.wikimedia.org/wikipedia/commons/1/17/Theophrastus_redivivus_%28Paris_manuscript%29.pdf ?
Or the previews? Pretty sure that the previews trigger "out of memory".

According to paravoid, the file creates issues on swift.

File has been deleted due to size causing server issues.

Doesn't need uploading again, it just be undeleted when the issues are resolved/worked around/prevented

Are there specific blockers that can be set on this bug?

Aaron, since you were debugging this yesterday, could you take a stab at answering TTO's question:

(In reply to This, that and the other from comment #11)

Are there specific blockers that can be set on this bug?

Aaron, could you answer comment 12, please?

Aaron, could you answer comment 12, please?

tomasz set Security to None.

Aaron, could you answer greg's comment, please?

File has been deleted due to size causing server issues.

! In T64423#652695, @TTO wrote:

Are there specific blockers that can be set on this bug?

@aaron: Do you know?

In T64423#652701, @greg wrote:

Aaron, since you were debugging this yesterday, could you take a stab at answering TTO's question:

(In reply to This, that and the other from comment #11)

Are there specific blockers that can be set on this bug?

Probably https://phabricator.wikimedia.org/T99263 and something analogous to 746c8ddc1d521f520242 in the shorter term.

Aklapper lowered the priority of this task from Medium to Low.

Is this still an issue?

No replies. Assuming this has been sorted out over the years. If not, please open a new ticket. Thanks!