World: r3wp
[PDF-Maker] discuss Gabriele's pdf-maker
older newer | first last |
DaveC 7-Jun-2007 [537x2] | Haven't had change to look at V2 yet. |
change = chance. | |
btiffin 25-Jun-2007 [539] | Before I look into it, I thought I'd just ask. We (the graphic desgner actually) got sent a logo graphic inside a PDF (and that is all that's in the PDF). Is there an easy way of extracting the image? I'm not a huge fan of selecting and cut'n'paste as it may lose sizing etc... More curious than worried. |
Graham 25-Jun-2007 [540x2] | text editor |
:) | |
btiffin 25-Jun-2007 [542] | It's a binary, but I've extracted it. Thanks for the hint. |
Chris 25-Jun-2007 [543x2] | Depends -- do you want to retain vector artwork? If not, you could rasterise it in Photoshop, GIMP, Seashore or other bitmap editors. |
Ah, you got it... | |
btiffin 25-Jun-2007 [545] | Well, I hacked it...and have no clue if what I did would work the next time :) |
Graham 25-Jun-2007 [546x2] | postscript and pdf are similar formats and store images as either binary or as text. |
unless it's one of those compressed pdf formats | |
Rondon 6-Mar-2008 [548] | Folks, I'm having a problem. How can I extract just the web links inside a pdf ? |
Anton 26-Mar-2008 [549] | There are tools (which I know of) in linux such as pdftotext. I presume the web links would appear in the output text, then you can use rebol parse to extract the links from that. |
amacleod 18-Apr-2008 [550] | Is there any way to go from pdf to makedoc format? |
Graham 19-Apr-2008 [551] | no |
amacleod 19-Apr-2008 [552] | Is a pdf file compressed or encrypted. Is there anyway to parse it? |
Reichart 19-Apr-2008 [553x2] | Depends on the PDF editor and how it was saved. Yes, it can be compressed. Yes, it can be protected (encrypted). It can also be just PostScript at its core. |
Also, keep in mind, PDF can contain all sorts of junk... Go here http://en.wikipedia.org/wiki/Portable_Document_Format Jump to PostScript.... | |
Pavel 24-Apr-2008 [555] | Reichart PDF was created in the way no executable postript code could be possible (no if no jumps no execute AND no write to disc) That is main advatage of PDF. Anyway it is possible to PDF contains tons of shit never would be imaged on page (old versions of documents for example). |
Alan 14-Sep-2008 [556] | . |
amacleod 8-Dec-2008 [557] | How tuff would it be to parse out the image data from a pdf file...or for that matter, the text. Formating not important... Or does anyone know of a good lib or dll that does this? |
Graham 8-Dec-2008 [558x4] | You can't |
unless the text is embedded in the pdf ... in which case there are utilties to pull it out. | |
if it's all grahphic ... eg. a scanned pdf .. then you need to ocr it. | |
I ocr pdfs all the time .... because my scanner produces PDFs | |
Robert 9-Dec-2008 [562] | Can PDF-Maker be used to create PDF-Forms that can be filled out by users with Acrobat Reader? |
Graham 9-Dec-2008 [563] | don't think so ... |
Gabriele 10-Dec-2008 [564] | Robert, I don't have any support for forms yet. |
Robert 10-Dec-2008 [565x2] | Ok. I don't know a lot about PDF files, but using the Adobe tools to make forms is a pain in the ... |
Doing this via a simple dialect would be very cool. Could become a nice tool. | |
amacleod 17-Dec-2008 [567x2] | Graham, Thanks for the reply...I've been working on the house so my computer has been down for a few days...anyway: I have found some utilites to extract images and text but I'm building a tool in rebol to coordinate the conversion of pdf's to text and extracted images, to edit them, and to upload. I was hopig for a rebol solution to incorporate it all into one app. The tool will be used by non-tech so I wanted it simple as possible. How do these extaction utilities work? Are they no parsing out the image data? |
Are they not parsing out the image data? | |
Graham 17-Dec-2008 [569] | goodluck is all I can say. |
Gabriele 18-Dec-2008 [570] | depending on how the PDF file is built, extracting text and images could be trivial, or very difficult. |
amacleod 18-Dec-2008 [571] | Looking at the files in a text editor I believe I can see were image data but where it exactly starts and ends I'm not sure. I tried to cut and past the data for a small image but I could not load it in rebol...just took a shot to see if it were in some staight forward binary format. |
kib2 13-Feb-2009 [572] | Hi. Is this project still alive ? Also, I would like to know if it's possible to render mathematical formulas inside (llike in TeX or Lout ). Thanks. |
Gabriele 14-Feb-2009 [573x2] | Define "alive". :-) |
I haven't spent any time on formulas. I don't have a need for that, and since it's quite time consuming to do it as good as TeX, I never really planned doing it. | |
kib2 14-Feb-2009 [575x2] | Gabriele: good to know it's alive, I was asking the question because the pages were not updated till 2006. Is that normal ? |
I suppose it must be a hard task to implement math rendering, as they may be nested. | |
Henrik 14-Feb-2009 [577] | it's probably better to go the tex->dvi->pdf route. |
Gabriele 15-Feb-2009 [578x3] | kib, that's why I asked "define alive" - if you ask whether I am working on it now, then no. if you ask whether i'll fix bugs that are reported, then yes, but you may have to wait. |
it's not hard to render math. it's just that TeX is very well tuned (uses the correct spacing between things), which is just a long list of rules and parameters. One could get that out from the TeX source... otherwise it just takes a long time to tweak all the parameters to get a nice looking result. | |
then, there's the problem of the fonts. TeX has its own fonts which are designed correctly. TTF is not even close to what MetaFont can do, and the standard symbol font is not as good as the TeX fonts. Even if I added a good formula renderer to the pdf maker, without a good font it wouldn't look very good anyway. | |
Janko 9-May-2009 [581] | Hi, are there any simple examples to get started with PDFM2 ? |
Graham 9-May-2009 [582] | Isn't there an example with the distribution? |
Janko 9-May-2009 [583] | you mean test.pdf ? |
Graham 9-May-2009 [584] | I presume so ... I haven't looked at pdfm2 for years |
Janko 9-May-2009 [585] | I am looking at that and trying to learn how to do things, but it's quite complex.. well I got some of the basics out already and am slowly forming my own pdf |
Janko 10-May-2009 [586] | I got to the point where document looks more like it should at the end so it will be doable :) ... still havent figured out how to set a fill color on something like a box or a table cell/row |
older newer | first last |