classification
Title: missing meta-info in documentation pdf
Type: behavior Stage: needs patch
Components: Documentation Versions: Python 3.6, Python 3.5
process
Status: open Resolution:
Dependencies: Superseder:
Assigned To: docs@python Nosy List: ZeD, berker.peksag, docs@python, georg.brandl, sandro.tosi
Priority: normal Keywords:

Created on 2009-05-02 06:42 by ZeD, last changed 2016-04-02 05:45 by berker.peksag.

Messages (5)
msg86931 - (view) Author: Vito De Tullio (ZeD) Date: 2009-05-02 06:42
from http://docs.python.org/download.html and
http://docs.python.org/3.0/download.html you can download the python
documentation in many formats (html, pdf, txt), I think auto-generated
by the .rst source.

While html and txt does not, the pdf format, supports some simple
"meta-data" infos about the document: other than the number of pages,
the page size, the creator, etc... (all auto-filled) there are 4 "keys"
that the current pdf miss to fill and I think it's important: Title,
Subject, Keywords, and Author.

try, for example, to use the simple pdfinfo tool (from
http://poppler.freedesktop.org/) to inspect the current pdf

$ tar xvjf python-3.0.1-docs-pdf-a4.tar.bz2
$ cd docs-pdf
$ ls -1
c-api.pdf
distutils.pdf
documenting.pdf
extending.pdf
howto-advocacy.pdf
howto-cporting.pdf
howto-curses.pdf
howto-doanddont.pdf
howto-functional.pdf
howto-regex.pdf
howto-sockets.pdf
howto-unicode.pdf
howto-urllib2.pdf
howto-webservers.pdf
install.pdf
library.pdf
reference.pdf
tutorial.pdf
using.pdf
whatsnew.pdf
$ pdfinfo library.pdf
Title:
Subject:
Keywords:
Author:
Creator:        LaTeX with hyperref package
Producer:       pdfTeX-1.40.9
CreationDate:   Sat Feb 14 11:33:09 2009
ModDate:        Sat Feb 14 11:33:09 2009
Tagged:         no
Pages:          1077
Encrypted:      no
Page size:      595.276 x 841.89 pts (A4)
File size:      7556857 bytes
Optimized:      no
PDF version:    1.4

erh, what is supposed to contain "using.pdf"? and "distutils.pdf"?

(Yes, I know what they contain, but...)

If the pdf were "tagged", not only me, but also my OS will know what's
in these pdf (see nepomuk/strigi/tracker/beagle programs, helping me
find what I'm searching
msg86932 - (view) Author: Georg Brandl (georg.brandl) * (Python committer) Date: 2009-05-02 07:47
I thought we already did assign these metadata items; looks like it's
been messed up somehow.  I'll fix this.
msg130297 - (view) Author: Sandro Tosi (sandro.tosi) * (Python committer) Date: 2011-03-07 23:32
AFAIK, this is obtained using \pdfinfo{} command[1][2][3]

[1] http://theoval.cmp.uea.ac.uk/~nlct/latex/pdfdoc/pdfdoc/pdfdoc.html
[2] http://tex.stackexchange.com/questions/5958/is-hyperref-really-the-best-way-to-add-metadata-to-a-tex-file
[3] http://www.tug.org/applications/hyperref/manual.html#x1-110003.7

Georg, do you think this is something that sphinx has to handle (in the latex builder) or somewhere else?
msg188834 - (view) Author: A.M. Kuchling (akuchling) * (Python committer) Date: 2013-05-10 14:01
This seems to be fixed now.  I downloaded python-3.3.1-docs-pdf-letter.tar.bz2 and ran pdfinfo on using.pdf:

Title:          Python Setup and Usage
Subject:        
Keywords:       
Author:         Guido van Rossum, Fred L. Drake, Jr., editor
Creator:        LaTeX with hyperref package
Producer:       pdfTeX-1.40.10
CreationDate:   Fri May 10 09:42:17 2013
ModDate:        Fri May 10 09:42:17 2013
Tagged:         no
Pages:          65
Encrypted:      no
Page size:      612 x 792 pts (letter)
File size:      432316 bytes
Optimized:      no
PDF version:    1.4

We could fill in Subject and Keywords, but the basic fields of Title and Author seem to be present.  (Do we need to change author to 'Python Development Group' or 'PSF'?)
msg262788 - (view) Author: Berker Peksag (berker.peksag) * (Python committer) Date: 2016-04-02 05:45
I get the following output when I try Python 3.5.1 docs:

$ pdfinfo using.pdf 
Title:          
Subject:        
Keywords:       
Author:         
Creator:        LaTeX with hyperref package
Producer:       pdfTeX-1.40.14
CreationDate:   Sat Apr  2 00:17:54 2016
ModDate:        Sat Apr  2 00:17:54 2016
Tagged:         no
Pages:          69
Encrypted:      no
Page size:      595.276 x 841.89 pts (A4)
File size:      368343 bytes
Optimized:      no
PDF version:    1.5
History
Date User Action Args
2016-04-02 05:45:19berker.peksagsetversions: + Python 3.5, Python 3.6, - Python 3.3
nosy: + berker.peksag

messages: + msg262788

type: resource usage -> behavior
stage: needs patch
2014-12-31 16:23:00akuchlingsetnosy: - akuchling
2013-05-10 14:01:01akuchlingsetnosy: + akuchling
messages: + msg188834
2011-03-07 23:32:28sandro.tosisetnosy: + sandro.tosi

messages: + msg130297
versions: + Python 3.3, - Python 3.1
2010-07-20 13:45:32BreamoreBoysetassignee: georg.brandl -> docs@python

nosy: + docs@python
2009-05-02 07:47:55georg.brandlsetmessages: + msg86932
2009-05-02 06:42:56ZeDcreate