r3wp [groups: 83 posts: 189283]
  • Home
  • Script library
  • AltME Archive
  • Mailing list
  • Articles Index
  • Site search
 

World: r3wp

[PDF-Maker] discuss Gabriele's pdf-maker

DaveC
7-Jun-2007
[537x2]
Haven't had change to look at V2 yet.
change = chance.
btiffin
25-Jun-2007
[539]
Before I look into it, I thought I'd just ask.  We (the graphic desgner 
actually) got sent a logo graphic inside a PDF (and that is all that's 
in the PDF).  Is there an easy way of extracting the image?  I'm 
not a huge fan of selecting and cut'n'paste as it may lose sizing 
etc...  More curious than worried.
Graham
25-Jun-2007
[540x2]
text editor
:)
btiffin
25-Jun-2007
[542]
It's a binary, but I've extracted it.  Thanks for the hint.
Chris
25-Jun-2007
[543x2]
Depends -- do you want to retain vector artwork?  If not, you could 
rasterise it in Photoshop, GIMP, Seashore or other bitmap editors.
Ah, you got it...
btiffin
25-Jun-2007
[545]
Well, I hacked it...and have no clue if what I did would work the 
next time :)
Graham
25-Jun-2007
[546x2]
postscript and pdf are similar formats and store images as either 
binary or as text.
unless it's one of those compressed pdf formats
Rondon
6-Mar-2008
[548]
Folks, I'm having a problem. How can I extract just the web links 
inside a pdf ?
Anton
26-Mar-2008
[549]
There are tools (which I know of) in linux such as pdftotext.  I 
presume the web links would appear in the output text, then you can 
use rebol parse to extract the links from that.
amacleod
18-Apr-2008
[550]
Is there any way to go from pdf to makedoc format?
Graham
19-Apr-2008
[551]
no
amacleod
19-Apr-2008
[552]
Is a pdf file compressed or encrypted. Is there anyway to parse it?
Reichart
19-Apr-2008
[553x2]
Depends on the PDF editor and how it was saved.
Yes, it can be compressed.  Yes, it can be protected (encrypted).
It can also be just PostScript at its core.
Also, keep in mind, PDF can contain all sorts of junk...

Go here http://en.wikipedia.org/wiki/Portable_Document_Format
Jump to PostScript....
Pavel
24-Apr-2008
[555]
Reichart PDF was created in the way no executable postript code could 
be possible (no if no jumps no execute AND no write to disc) That 
is main advatage of PDF. Anyway it is possible to PDF contains tons 
of shit never would be imaged on page (old versions of documents 
for example).
Alan
14-Sep-2008
[556]
.
amacleod
8-Dec-2008
[557]
How tuff would it be to parse out the image data from a pdf file...or 
for that matter, the text. Formating not important...
Or does anyone know of a good lib or dll that does this?
Graham
8-Dec-2008
[558x4]
You can't
unless the text is embedded in the pdf ... in which case there are 
utilties to pull it out.
if it's all grahphic ... eg. a scanned pdf .. then you need to ocr 
it.
I ocr pdfs all the time .... because my scanner produces PDFs
Robert
9-Dec-2008
[562]
Can PDF-Maker be used to create PDF-Forms that can be filled out 
by users with Acrobat Reader?
Graham
9-Dec-2008
[563]
don't think so ...
Gabriele
10-Dec-2008
[564]
Robert, I don't have any support for forms yet.
Robert
10-Dec-2008
[565x2]
Ok. I don't know a lot about PDF files, but using the Adobe tools 
to make forms is a pain in the ...
Doing this via a simple dialect would be very cool. Could become 
a nice tool.
amacleod
17-Dec-2008
[567x2]
Graham, Thanks for the reply...I've been working on the house so 
my computer has been down for a few days...anyway:

I have found some utilites to extract images and text but I'm building 
a tool in rebol to coordinate the conversion of pdf's to text and 
extracted images, to edit them, and to upload. I was hopig for a 
rebol solution to incorporate it all into one app. The tool will 
be used by non-tech so I wanted it simple as possible.

How do these extaction utilities work? Are they no parsing out the 
image data?
Are they not parsing out the image data?
Graham
17-Dec-2008
[569]
goodluck is all I can say.
Gabriele
18-Dec-2008
[570]
depending on how the PDF file is built, extracting text and images 
could be trivial, or very difficult.
amacleod
18-Dec-2008
[571]
Looking at the files in a text editor I believe I can see were image 
data but where it exactly starts and ends I'm not sure. I tried to 
cut and past the data for a small image but I could not load it in 
rebol...just took a shot to  see if it were in some staight forward 
binary format.
kib2
13-Feb-2009
[572]
Hi. Is this project still alive ? Also, I would like to know if it's 
possible to render mathematical formulas inside (llike in TeX or 
Lout ). Thanks.
Gabriele
14-Feb-2009
[573x2]
Define "alive". :-)
I haven't spent any time on formulas. I don't have a need for that, 
and since it's quite time consuming to do it as good as TeX, I never 
really planned doing it.
kib2
14-Feb-2009
[575x2]
Gabriele: good to know it's alive, I was asking the question because 
the pages were not updated till 2006. Is that normal ?
I suppose it must be a hard task to implement math rendering, as 
they may be nested.
Henrik
14-Feb-2009
[577]
it's probably better to go the tex->dvi->pdf route.
Gabriele
15-Feb-2009
[578x3]
kib, that's why I asked "define alive" - if you ask whether I am 
working on it now, then no. if you ask whether i'll fix bugs that 
are reported, then yes, but you may have to wait.
it's not hard to render math. it's just that TeX is very well tuned 
(uses the correct spacing between things), which is just a long list 
of rules and parameters. One could get that out from the TeX source... 
otherwise it just takes a long time to tweak all the parameters to 
get a nice looking result.
then, there's the problem of the fonts. TeX has its own fonts which 
are designed correctly. TTF is not even close to what MetaFont can 
do, and the standard symbol font is not as good as the TeX fonts. 
Even if I added a good formula renderer to the pdf maker, without 
a good font it wouldn't look very good anyway.
Janko
9-May-2009
[581]
Hi, are there any simple examples to get started with PDFM2 ?
Graham
9-May-2009
[582]
Isn't there an example with the distribution?
Janko
9-May-2009
[583]
you mean test.pdf ?
Graham
9-May-2009
[584]
I presume so ... I haven't looked at pdfm2 for years
Janko
9-May-2009
[585]
I am looking at that and trying to learn how to do things, but it's 
quite complex.. well I got some of the basics out already and am 
slowly forming my own pdf
Janko
10-May-2009
[586]
I got to the point where document looks more like it should at the 
end so it will be doable :) ... still havent figured out how to set 
a fill color on something like a box or a table cell/row