Get fonts used in a PDF document with Adobe Acrobat

intoto · August 8, 2011, 1:43pm

Does anyone know if it is possible to return a list of fonts being used in a PDF document using Acrobat?

As shown in the results of the ‘File’ > ‘Properties…’ under the Fonts tab.

There is very little information in the Adobe Acrobat Professional Dictionary when searching for ‘fonts’ or ‘properties’.

Thanks in advance…

divster · August 9, 2011, 12:50pm

I’ve never scripted Acrobat before, but as you say the reference is a bit sketchy.

A clunky workaround might be to open the PDF in TextEdit and search for this kind of thing:
/FontName /GIFTLG+ZapfChancery-Italic /

Fonts are preceeded by/FontName / so it shouldn’t be too difficult to extract that info using applescript

intoto · August 9, 2011, 2:29pm

Thanks for that.

It got me thinking… A (probably) less ‘clunky’ solution could be:-

set myDoc to (choose file with prompt "Choose a pdf")
set pathToDoc to POSIX path of myDoc

set allFonts to do shell script "grep -a 'FontName' ../../" & pathToDoc

giving the following:-

"
endstream
endobj
43 0 obj
<</StemV 51/FontName/CGNPHB+Courier/FontFile2 42 0 R/Flags 35/Descent -246/FontBBox[-28 -250 628 805]/Ascent 753/FontFamily(Courier)/CapHeight 562/XHeight 426/Type/FontDescriptor/ItalicAngle 0/StemH 51>>
endobj
44 0 obj
<</Subtype/Type1C/Length 877/Filter/FlateDecode>>stream

endstream
endobj
45 0 obj
<</StemV 142/FontName/CGJNAA+HelveticaNeueLTPro-Bd/FontFile3 44 0 R/Flags 262176/Descent 0/FontBBox[-166 -218 1078 975]/Ascent 0/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/StemH 107/CharSet(/hyphen/M/a/j/o/r/c)>>
endobj
46 0 obj
<</Subtype/Type1C/Length 450/Filter/FlateDecode>>stream

endstream
endobj
47 0 obj
<</StemV 108/FontName/CGJNAB+HelveticaNeueLTPro-MdCn/FontFile3 46 0 R/Flags 262176/Descent 0/FontBBox[-164 -216 1031 951]/Ascent 0/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/StemH 86/CharSet(/space/hyphen/bar)>>
endobj
48 0 obj
<</Subtype/Type1C/Length 2125/Filter/FlateDecode>>stream

endstream
endobj
49 0 obj
<</StemV 138/FontName/CGJNAC+HelveticaNeueLTPro-BdCn/FontFile3 48 0 R/Flags 262176/Descent -184/FontBBox[-164 -224 1066 961]/Ascent 0/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/StemH 106/CharSet(/space/hyphen/zero/eight/four/one/two/five/nine/w/period/t/h/o/m/a/s/c/k/V/i/y/u/r/l/v/e/g/n)>>
endobj
50 0 obj
<</Subtype/Type1C/Length 574/Filter/FlateDecode>>stream

endstream
endobj
51 0 obj
<</StemV 85/FontName/CGJNBE+HelveticaNeueLTPro-Roman/FontFile3 50 0 R/Flags 32/Descent 0/FontBBox[-166 -214 1076 952]/Ascent 0/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/StemH 75/CharSet(/hyphen/four/three)>>
endobj
52 0 obj
<</Subtype/Type1C/Length 2912/Filter/FlateDecode>>stream

endstream
endobj
53 0 obj
<</StemV 0/FontName/CGNPDL+HelveticaNeueLTPro55Roman/FontFile3 52 0 R/Flags 4/Descent -210/FontBBox[-156 -208 1072 968]/Ascent 714/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/CharSet(/D/o/n/quoteright/t/f/l/y/s/period/B/k/a/e/w/i/h/u/r/m/d/p/g/four)>>
endobj
54 0 obj
<</Length 297/Filter/FlateDecode>>stream

endstream
endobj
57 0 obj
<</StemV 0/FontName/CGNPEN+HelveticaNeueLTPro75Bold/FontFile3 56 0 R/Flags 4/Descent 0/FontBBox[-156 -208 1072 979]/Ascent 0/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/CharSet(/C/h/o/s/e/y/u/r/a/t/sterling/one/zero/D/v/S/i/ampersand/p)>>
endobj
58 0 obj
<</Length 298/Filter/FlateDecode>>stream

endstream
endobj
61 0 obj
<</StemV 0/FontName/CGNPEO+HelveticaNeueLTPro57Condensed/FontFile3 60 0 R/Flags 4/Descent -184/FontBBox[-156 -208 1000 937]/Ascent 714/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/CharSet(/A/c/o/m/d/a/t/i/n/R/y/p/e/N/s/two/nine/M/r/hyphen/one/eight/three/zero/six/seven/J/u/four/l/five/g/S/O/C/colon/parenleft/h/parenright/U/f/sterling/H/b/comma/w/ampersand/F/k/v/period/P/quoteleft/Y/quoteright/I/B)>>
endobj
62 0 obj
<</Length 347/Filter/FlateDecode>>stream

endstream
endobj
65 0 obj
<</StemV 0/FontName/CGNPEP+HelveticaNeueLTPro77BoldCondensed/FontFile3 64 0 R/Flags 4/Descent -184/FontBBox[-156 -250 1062 947]/Ascent 714/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/CharSet(/two/zero/one/P/R/I/C/E/S/F/O/M/H/L/D/s/t/h/i/l/d/n/e/f/c/a/r/g/A/b/o/m/three/five/nine/eight/four/six/seven/hyphen/u/p/slash/colon/v/k/period/sterling/quoteright/comma/y/w/q)>>
endobj
66 0 obj
<</Length 316/Filter/FlateDecode>>stream"

But even this is hardly a list of just the Fonts used in a document.

Does anyone know of a quick way to strip this down?

Many Thanks.

intoto · August 9, 2011, 2:42pm

I now have:-

set myDoc to (choose file with prompt "Choose a pdf")
set pathToDoc to POSIX path of myDoc

set allFonts to do shell script "tr '\\r' '\\n' < ../../" & pathToDoc & " | grep -a 'FontName'"

Resulting in:-

“<</StemV 0/FontName/CHDCMG+HelveticaNeueLTPro55Roman/FontFile3 44 0 R/Flags 4/Descent -210/FontBBox[-156 -208 1072 968]/Ascent 714/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/CharSet(/D/o/n/quoteright/t/f/l/y/s/period/B/k/a/e/w/i/h/u/r/m/d/p/g/four)>>
<</StemV 0/FontName/CHDCMH+HelveticaNeueLTPro75Bold/FontFile3 48 0 R/Flags 4/Descent 0/FontBBox[-156 -208 1072 979]/Ascent 0/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/CharSet(/C/h/o/s/e/y/u/r/a/t/sterling/one/zero/P/l/n/i/b/R)>>
<</StemV 0/FontName/CHDCMI+HelveticaNeueLTPro57Condensed/FontFile3 52 0 R/Flags 4/Descent -184/FontBBox[-156 -208 1000 937]/Ascent 714/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/CharSet(/A/c/o/m/d/a/t/i/n/R/y/p/e/N/s/two/nine/M/r/hyphen/one/eight/three/zero/six/seven/J/u/four/l/five/g/S/O/C/colon/parenleft/h/parenright/F/b/f/sterling/comma/ampersand/V/k/v/period/P/w/quoteleft/Y/H/quoteright/I/B)>>
<</StemV 0/FontName/CHDCMJ+HelveticaNeueLTPro77BoldCondensed/FontFile3 56 0 R/Flags 4/Descent -184/FontBBox[-156 -250 1062 947]/Ascent 714/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/CharSet(/two/zero/one/P/R/I/C/E/S/F/O/M/H/L/D/s/t/h/i/l/d/n/a/f/b/o/r/A/eight/V/g/e/m/four/nine/six/five/three/seven/hyphen/u/p/slash/c/colon/v/k/period/sterling/quoteright/comma/y/w/q)>>
<</StemV 51/FontName/CHDDEP+Courier/FontFile2 60 0 R/Flags 35/Descent -246/FontBBox[-28 -250 628 805]/Ascent 753/FontFamily(Courier)/CapHeight 562/XHeight 426/Type/FontDescriptor/ItalicAngle 0/StemH 51>>
<</StemV 142/FontName/CHBOLM+HelveticaNeueLTPro-Bd/FontFile3 62 0 R/Flags 262148/Descent 0/FontBBox[-166 -218 1078 975]/Ascent 714/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/StemH 107/CharSet(/space/hyphen/M/a/j/o/r/c/P/l/e/n/t/i/C/u/b/R/s/H/d/y/v/A/m/L/F)>>
<</StemV 114/FontName/CHBOLN+HelveticaNeueLTPro-Md/FontFile3 66 0 R/Flags 4/Descent -205/FontBBox[-165 -221 1066 952]/Ascent 714/CapHeight 714/XHeight 517/Type/FontDescriptor/ItalicAngle 0/StemH 90/CharSet(/space/hyphen/A/c/t/i/v/e/a/p/l/I/m/r/s/w/F/u/n/f/o/k/d/T/h/P/C/b/R/g/comma/x/period/quotesingle/y/j/L/V/parenleft/two/four/parenright/H/B/O/bullet/G/zero/D/sterling/one/six/eight/M/slash/S/ampersand/E)>>
<</StemV 170/FontName/CHBOMP+HelveticaNeueLTPro-Hv/FontFile3 70 0 R/Flags 262148/Descent 0/FontBBox[-169 -234 1096 951]/Ascent 714/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/StemH 144/CharSet(/space/hyphen/O/u/r/t/o/p/m/a/k/s/A/d/e/v/l/Y/i/w/I/n/c)>>
<</StemV 198/FontName/CHBONA+HelveticaNeueLTPro-Blk/FontFile3 74 0 R/Flags 262176/Descent 0/FontBBox[-165 -232 1101 953]/Ascent 0/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/StemH 144/CharSet(/hyphen/nine/five/percent/three/eight/seven)>>
<</StemV 63/FontName/CHBONB+HelveticaNeueLTPro-LtIt/FontFile3 76 0 R/Flags 96/Descent -206/FontBBox[-165 -214 1099 951]/Ascent 714/CapHeight 0/Type/FontDescriptor/ItalicAngle -12/StemH 53/CharSet(/space/hyphen/F/e/a/t/u/r/s/h/o/w/n/b/j/c/g/d/i/endash/quotesingle/A/l/I/v/y/p/three/one/seven/f/period)>>
<</StemV 63/FontName/CHBOOC+HelveticaNeueLTPro-Lt/FontFile3 78 0 R/Flags 4/Descent -206/FontBBox[-166 -214 1050 967]/Ascent 714/CapHeight 714/XHeight 516/Type/FontDescriptor/ItalicAngle 0/StemH 53/CharSet(/space/hyphen/H/O/L/I/D/A/Y/S/W/T/E/m/a/l/s/t/r/e/c/h/o/f/b/p/x/period/five/zero/w/y/semicolon/two/k/P/u/n/i/eight/nine/g/d/v/parenleft/parenright/comma/agrave/F/q/colon/quotesingle/one/four/V/M/three/B/j/C/ampersand/G)>>
<</StemV 162/FontName/CHBOOD+HelveticaNeueLTPro-HvCn/FontFile3 82 0 R/Flags 262176/Descent 0/FontBBox[-166 -230 1100 975]/Ascent 0/CapHeight 714/Type/FontDescriptor/ItalicAngle 0/StemH 124/CharSet(/space/hyphen/A/L/I/N/C/U/S/V/E/O/P/T/W/R/D/F/X)>>
<</StemV 85/FontName/CHBOOE+HelveticaNeueLTPro-Roman/FontFile3 84 0 R/Flags 32/Descent 0/FontBBox[-166 -214 1076 952]/Ascent 0/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/StemH 75/CharSet(/hyphen/four)>>
<</StemV 90/FontName/CHCPEO+ZapfDingbats/FontFile3 86 0 R/Flags 4/Descent 0/FontBBox[-1 -143 981 820]/Ascent 0/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/StemH 28/CharSet(/space/a19/a61)>>”

intoto · August 9, 2011, 3:03pm

I have it :lol: (if anyone is interested):-

set myDoc to (choose file with prompt "Choose a pdf")
set pathToDoc to POSIX path of myDoc

set allFonts to do shell script "tr '\\r' '\\n' < ../../" & pathToDoc & " | grep -a 'FontName' | cut -d '/' -f 4 | cut -d '+' -f 2"

A big thanks to divster for your suggestion. I now don’t even have to open Acrobat!!

Monostratos · August 9, 2011, 3:19pm

intoto:

I have it :lol: (if anyone is interested):-
set myDoc to (choose file with prompt "Choose a pdf")
set pathToDoc to POSIX path of myDoc

set allFonts to do shell script "tr '\\r' '\\n' < ../../" & pathToDoc & " | grep -a 'FontName' | cut -d '/' -f 4 | cut -d '+' -f 2"
A big thanks to divster for your suggestion. I now don’t even have to open Acrobat!!

Nicely done.
Just a few more kinks: Files with spaces in their filenames return nothing. And exotic characters in the filename makes the script fail. Any ideas?

intoto · August 9, 2011, 3:35pm

I too found that paths with spaces in them fail, and also that the font name is not always in the same place within the resuls.
The following addresses this

set myDoc to (choose file with prompt "Choose a pdf")
set pathToDoc to POSIX path of myDoc

set allFonts to do shell script "tr '\\r' '\\n' < ../../" & quoted form of pathToDoc & " | grep -a 'FontName' | cut -d '+' -f 2 | cut -d '/' -f 1"

Still not sure on the ‘exotic’ characters…

Monostratos · August 9, 2011, 4:00pm

Now it’s working on most pdfs I’ve tried, but it still returns empty on some of them. I’m not sure it’s about the characters in the file name, maybe they’re just structured differently. I could imagine that pdf standards have changed quite a few times.

intoto · August 9, 2011, 4:25pm

I have found that some fonts (but not the majority) do not have the 6 characters and ‘+’ in the string returned. As I am using the ‘+’ as a delimiter in the Unix ‘cut’ command, this is giving some odd results.

Fonts in one document are:-

<</StemV 121/FontName/CronosPro-Semibold/FontFile3 98 0 R/Flags 262148/Descent 0/FontBBox[-185 -287 1135 885]/Ascent 0/CapHeight 630/Type/FontDescriptor/ItalicAngle 0/StemH 84>>
<</StemV 51/FontName/PIHEJF+Courier/FontFile2 122 0 R/Flags 35/Descent -246/FontBBox[-28 -250 628 805]/Ascent 753/FontFamily(Courier)/CapHeight 562/XHeight 426/Type/FontDescriptor/ItalicAngle 0/StemH 51>>
<</StemV 84/FontName/CronosPro-Regular/FontFile3 113 0 R/Flags 4/Descent -217/FontBBox[-179 -270 1129 886]/Ascent 679/CapHeight 630/Type/FontDescriptor/ItalicAngle 0/StemH 61>>
<</StemV 0/FontName/IPJGKH+Dingbats-ArrowsTwo/FontFile3 101 0 R/Flags 6/Descent 53/FontBBox[57 -7 783 689]/Ascent 621/XHeight 687/CapHeight 584/Type/FontDescriptor/ItalicAngle 0/CharSet(/hyphen/q/space)>>
<</StemV 88/FontName/CronosPro-Capt/FontFile3 121 0 R/Flags 32/Descent -217/FontBBox[-178 -277 1159 886]/Ascent 678/XHeight 449/CapHeight 0/Type/FontDescriptor/ItalicAngle 0/StemH 62>>

Which when cut using the ‘+’ delimiter gives:-

Problem is, as the FontName is not always in the same position, I cannot use the ‘/’ as a delimiter.

would repeating through each line (paragraphs of allFonts) and using the offset of “FontName/” +1 to “/” work??

divster · August 9, 2011, 11:39pm

Hi intoto

Glad you’re having some success. I don’t use grep in a shell script but do use it in InDesign for styles. Hopefully the syntax is similar enough to be useful. If I use the following:

This is positive look behind for /FontName/ followed by any character repeated one or more times shortest match followed by positive look ahead for /

I can highlight the font names in your example as follows:

Then it would be easy to chop off the other rubbish with:

This is any upper case character (5 times) followed by + (escaped with a )

Hope this helps

intoto · August 10, 2011, 8:46am

divster:

I don’t use grep in a shell script but do use it in InDesign for styles. Hopefully the syntax is similar enough to be useful. If I use the following:
This is positive look behind for /FontName/ followed by any character repeated one or more times shortest match followed by positive look ahead for /

I can highlight the font names in your example as follows:

CronosPro-Semibold
PIHEJF+Courier
CronosPro-Regular
IPJGKH+Dingbats-ArrowsTwo
CronosPro-Capt

Then it would be easy to chop off the other rubbish with:

\u\u\u\u\u\u+

This is any upper case character (5 times) followed by + (escaped with a )

Hope this helps

Thanks again divster!

I’m not familiar with that ‘kind’ of grep in Unix, but again based on your suggestion and working with the Unix grep and regex I now have the following that returns everything after the FileName/ and then repeats through every line and if it contains the “+” character, it ‘cuts’ it out.

set myDoc to (choose file with prompt "Choose a pdf" without invisibles)
set pathToDoc to POSIX path of myDoc

set allFonts to paragraphs of (do shell script "tr '\\r' '\\n' < ../../" & quoted form of pathToDoc & " | egrep -ao \"FontName.*/\" | cut -d '/' -f 2") -- | cut -d '/' -f 1")

set fontList to {}
repeat with i from 1 to number of items in allFonts
	set thisItem to item i of allFonts
	if thisItem contains "+" then
		set thisItem to do shell script "echo " & thisItem & " | cut -d'+' -f2 "
		set fontList to fontList & thisItem
	else
		set fontList to fontList & thisItem
	end if
end repeat
fontList

I’ve not tested loads of PDFs, but so far so good. Thanks for everyones input.

divster · August 10, 2011, 11:27pm

Good work intoto

I’ve saved your script for later, just in case!

Do you know any good resources for learning shell script?

Model: 2 x 2.8 GHz Quad-Core Intel Xeon
AppleScript: 2.1.2
Browser: Firefox 3.6.14
Operating System: Mac OS X (10.6)

intoto · August 11, 2011, 9:31am

Hi divster

http://ss64.com/osx/

is a good site for Mac specific Unix commands. For everything else google search “Mac OS X Unix commands” (Some Linux/Solaris commands are not available in Mac OS X!)

I also refer to the book “Mac OS X UNIX Toolbox - 1000+ Commands for Mac OS X Unix”

Does anyone else have any resources to add?