Bryce, IRS PDF files: I brought down the latest win build (071229) and repeated the tests on the irs.gov pdf files a few times. Results: fw4 Runtime error. fw9 Imports as US Legal sized doc. No issues. fss4 Imports as US Legal sized doc. No issues. f1040 Runtime error. f1040 repeat Runtime error f1040 repeat Runtime error f1040sab Runtime error. f1040sab repeat Runtime error. f1040sab repeat Runtime error fw4 repeat. Runtime error. fw9 repeat. Imports as US Legal sized doc. No issues. fw9 repeat. Imports as US Legal sized doc. No issues. fw4 repeat. Runtime error. fw4 repeat. Runtime error. fw4 repeat. Runtime error. fw4 repeat with precision at 246. Runtime error. fw4 repeat with embed images unset. Runtime error.
These results match the previous version with 2/5 files importing. 40% success rate.
Then I tested several other files that I have on hand and have sources for.
1. canon_lbp800_getting_started_guide.pdf This is a 22 page manual (427KB) available at http://canon.com.au/products/printers/laser_printers_low_medium_volume/lbp80... This file opened at page 1. It came in as a group of 10 objects. Issues: Text import appears complete and properly positioned at least for page 1. However, there are some formatting and font handling issues. Bolding and italics present in both Inkscape and AReader but the font appears different.
2. 0707news. This is a 6 page document with complex text. available at http://www.bluemountains.org.au/HutNews/0707news.pdf Page 1 came in almost identical to A Reader. There are font/formatting issues but these are minor. Graphics were perfect.
3. code_en.pdf. This is a 4 page document from available from http://www.dhamma.org/en/docs/core/code-en.pdf Issues. Text import appears complete and properly positioned at least for page 1. However, formatting and font handling are off. There is bolding but italics are not present in Inkscape. Kerning is woeful or non-existent in Inkscape version. It came in as a group of 18 objects.
4. ref_guide_eng.pdf. This 22 page document is available from http://www.gpstm.com/download/ref_guide_eng.pdf This came in as a group of 1 object. It appears to be identical with the A Reader version with respect to text and colour.
5. http://www.sydneyrockies.org.au/pdfs/evans_crown_submission_jan_2007.pdf This document is a 4 page document that opened at page 1. Text imported complete and properly positioned. Bolding and font issues.
5/5 ie. 100% import success rate. I also imported numerous other pdf files with no problems. In fact these irs.com files are the only ones I have had trouble with.
Given the variation in settings that people can use to set up PDF files in the first place, I think that Inkscape is doing a fantastic job at present. We are certainly importing graphic PDFs and most text that I have tried.
cheers, Erik kaver@...68...
----- Original Message ----- From: "Bryce Harrington" <bryce@...961...> To: "kaver" <kaver@...68...> Cc: inkscape-devel@lists.sourceforge.net Sent: Monday, December 31, 2007 2:23 PM Subject: Re: [Inkscape-devel] Does PDF Import actually work?
On Mon, Dec 31, 2007 at 02:01:27PM +1100, kaver wrote:
Bryce, Yes..it does....sort of.
I'm wondering if there are configure options needed to build it in? E.g. is --enable-poppler-cairo required?
PDF opening and import seem a bit buggy. I am using build 0712241827
with
win2000 and have looked at the irs site you mentioned. I find:
w4 does not import and throws the message "Runtime error-abnormal
program
termination" w9 does import. 1040 does not import ss-4 does import schedA&B (1040) does not import.
Hrm, only 50/50 success on IRS PDF's is a bit worrisome.
The PDF audience dwarfs the SVG audience, and I worry we could be inviting an innundation of bug reports. Maybe we should mark this feature "EXPERIMENTAL"?
I also tried to open/import several other text pdfs that I had lying
around.
All of these opened. They included several single page documents and a couple of 166 page documents. Only the first page of the latter came out
but
there was no termination.of Inkscape.
Awesome, thanks for testing this. Would you be willing to assemble a collection of PDFs you've tested, and list the issues (if any) encountered with them?
I do notice that in all cases the "pdf import settings" window appears
to
show the document. This includes w4 and 1040 etc.
Okay, my issue is probably just some configurational thing. Good to know the import works generally; can't wait to test it!
cheers, Erik
I also notice that my Adobe Updater pretty frequently is giving me
updates
to my Adobe Reader. Todays for example was 8.1.1 Update. Maybe the
goalposts
are in motion.
Such is life in Open Source when chasing the big boys. ;-)
Bryce