You are viewing limited content. For full access, please sign in.

Question

Question

Large PDF 'corruption' in WebLink

asked on March 1, 2018

When users try to download a large PDF (ie: around 10 MB or larger) through the download button in the document viewer in WebLink, the PDF file will not open in Adobe Reader DC (or Acrobat Standard) - it is listed as 'corrupted'.  The same file downloads fine in the client or web client.  The PDF does seem to open in the built in viewer in Edge.

I've noticed that the file size is different when I pull it from the client vs from WebLink as well.  

Anyone experiencing the same issues?  I can reproduce this with any of our larger files it seems.

0 0

Replies

replied on March 1, 2018

Ok so it doesn't appear to necessarily be size related, so I'm a bit at a loss.  Perhaps its how the original PDF was created that causes the issue?  I can tell what is happening is there is a whole bunch of content being added to the original PDF after the %EOF% marker, which is not there when the file is exported from the thick or web client but is there when it is downloaded via WebLink.  I'm not sure what kind of processing it is trying to do but it's certainly causing some sort of issue.  Microsoft Edge appears to ignore the content but Acrobat does not like it.

I've attached two samples, one from the thick client and one from WebLink.  Original file is the same, just renamed to identify which is which.

 

0 0
replied on March 1, 2018

Are these documents that are stored as PDFs in your repository, or are the PDFs generated from image pages?

0 0
replied on March 1, 2018

Hi Brian,

They are stored in the repository, however there are images generated for them as well which was done for OCR reasons I believe when we had it setup.  When we download the files we want the original document though, not the generated images.  This has always worked before.

When I take the 'From Weblink' file above and remove the extra content added to it (everything after the %%EOF tag) and resave it, the file opens in Acrobat and is identical to the 'From Client' file in every way including file size.  So something is causing WebLink to add extra content after the %%EOF line.  If you have a text editor it's on line 11223.

I'm not sure if I mentioned this but this is only affecting SOME pdf files, not all of them.  I can't seem to figure out what the difference is though. 

0 0
replied on March 1, 2018

Yes, and what is added after the EOF marker appears to be repeated from earlier in the document.  I'll file a bug for this.

0 0
replied on March 2, 2018

Thanks Brian, are you able to reproduce this issue with this document?  Just curious if it's our environment somehow, seems like it would be a larger issue if it was wide-spread.

 

0 0
replied on March 2, 2018

Hello Shaun,

 

To troubleshoot your issue further, would you be able to open a support case through your Reseller. Through that channel we can investigate further into the issue you are seeing.

 

Regards,

0 0
replied on March 2, 2018

Thanks Andrew, I've opened a ticket (193135).

 

0 0
replied on April 27, 2018

Hello,

We have the same problem and exactly the same behavior as Shaun described before. This is happening with some PDF documents.

Is it going to be solved with the next Weblink 10.x release?

 

Regards

0 0
You are not allowed to follow up in this post.

Sign in to reply to this post.