• What is the best free software for creating & editing PDFs nowadays

    From Marion@21:1/5 to All on Fri Feb 28 20:09:29 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    In a recent thread, the perennial topic came up, which needs updating:
    *Software for creating and editing PDFs*
    <https://www.novabbs.com/computers/article-flat.php?id=9874&group=alt.comp.os.windows-11#9874>

    Funny story: Decades ago, in the Silicon Valley, I asked my company IT department to consider PDFs and they wrote back vehemently that they
    researched what PDF was (as it was brand new, but they knew about
    PostScript) and they wrote a scathing denial email saying emphatically that they do NOT want to "support yet another standard" (Microsoft Office being their standard at that time). Heh heh heh... I wish I saved that email...

    It has been a few years... I think collectively we need to update this
    chart of the single best freeware for the stated PDF editing needs...

    [?] Print book format PDF (FinePrint payware)
    [x] Add or concatenate pages (pdftk, acrobat payware)
    [x] Add signature (Adobe Reader Fill-and-sign sign-yourself tool)
    [x] Archive sites (wkhtmltopdf, Acrobat payware,fastone scroll capture)
    [x] Convert PDF to MSWord or any epub format & vice versa (Calibre)
    [x] Create PDF new text (Irfanview or Paint.NET plugins + Ghostscript)
    [x] Edit PDF existing text (Adobe Reader commenting, Acrobat payware)
    [x] Extract images (PDF Exchange Viewer, PDF Shaper)
    [x] Fast PDF reader: (Sumatra or Foxit)
    [x] Globally search & replace PDF text (Libre Office)
    [x] Merge PDFs (pdfsam, pdftk)
    [x] OCR, PDF-Xchange, freeOCR (paperfile.net), GOCR (jocr.sourceforge.net)
    [x] Online shrink PDF https://www.adobe.com/acrobat/online/compress-pdf.html [x] PDF text to audio file (Balabolka)
    [x] Print sans username in the properties (Libre Office Writer)
    [x] Remove pages (pdfsam, pdftk)
    [x] Remove restrictions (Ghostscript & Ghostview with ps2edit & pdfwrite or pdf2djvu)
    [x] Renumber pages (Acrobat Reader)
    [x] Reorder pages (mutool)
    [x] Rotate pages (Acrobat Reader)
    [x] Shrink PDFs (ImageMagick or Acrobat payware or rlvision shareware)
    [x] Tile PDFs (i.e., to print large posters) (Posterazor)
    [?] What other tasks do you do to edit or modify a PDF file?

    What are your suggestions (so that everyone benefits from your knowledge)?

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to Lawrence D'Oliveiro on Fri Feb 28 23:56:37 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Fri, 28 Feb 2025 21:00:53 -0000 (UTC), Lawrence D'Oliveiro wrote :


    ... I think collectively we need to update this
    chart of the single best freeware for the stated PDF editing needs...

    I prefer Free software to freeware, myself. <https://en.wikipedia.org/wiki/Freeware> <https://en.wikipedia.org/wiki/Free_software>

    Thanks for the clarification where I see, from your links, that "freeware"
    is about cost, while "free software" is about user rights and freedoms.

    While I appreciate the gentle word-use admonition, it's kind of like when I
    ask people to use "lend" as a verb vs "loan" as a noun; or when I notice
    people using "further" for "farther" in terms of distances; or when people
    use "less" instead of "fewer" for things that can be counted; or when
    people use "dirt" when they really mean "soil"; or when they call a "stone"
    a "rock" when all of these things are actually not what people think.

    But not many people know those distinctions, such as what it really means
    for two people to be "Platonic", although it's not as bad as when people
    say "I could care less" when what they mean is the exact opposite feeling.

    Taking your kind advice in hand, I see The key difference between
    "freeware" and "free software" lies in the concept of freedom, not just
    cost, in terms of who retains copyright and controls distribution and modification, specifically the freedom to run the program for any purpose,
    such as to study how the program works and to maybe change it, and maybe
    even redistribute modified versions.

    I accept your suggestion to keep in mind that free software is often
    available at no cost, not all software available at no cost is free
    software.

    With that taken care of to an appropriate level of clarification, what I
    ask the team at large to help out for, is to flesh out this table.

    What else is needed to be done with a PDF file & which programs do it?
    [?] Print book format PDF (FinePrint payware)
    [x] Add or concatenate pages (pdftk, acrobat payware)
    [x] Add signature (Adobe Reader Fill-and-sign sign-yourself tool)
    [x] Archive sites (wkhtmltopdf, Acrobat payware,fastone scroll capture)
    [x] Convert PDF to MSWord or any epub format & vice versa (Calibre)
    [x] Create PDF new text (Irfanview or Paint.NET plugins + Ghostscript)
    [x] Edit PDF existing text (Adobe Reader commenting, Acrobat payware)
    [x] Extract images (PDF Exchange Viewer, PDF Shaper)
    [x] Fast PDF reader: (Sumatra or Foxit)
    [x] Globally search & replace PDF text (Libre Office)
    [x] Merge PDFs (pdfsam, pdftk)
    [x] OCR, PDF-Xchange, freeOCR (paperfile.net), GOCR (jocr.sourceforge.net)
    [x] Online shrink PDF https://www.adobe.com/acrobat/online/compress-pdf.html [x] PDF text to audio file (Balabolka)
    [x] Print sans username in the properties (Libre Office Writer)
    [x] Remove pages (pdfsam, pdftk)
    [x] Remove restrictions (Ghostscript & Ghostview with ps2edit & pdfwrite or pdf2djvu)
    [x] Renumber pages (Acrobat Reader)
    [x] Reorder pages (mutool)
    [x] Rotate pages (Acrobat Reader)
    [x] Shrink PDFs (ImageMagick or Acrobat payware or rlvision shareware)
    [x] Tile PDFs (i.e., to print large posters) (Posterazor)
    [?] What other common tasks do you do to edit or modify a PDF file?

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to Marion on Sat Mar 1 00:21:43 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Fri, 28 Feb 2025 23:56:37 -0000 (UTC), Marion wrote :


    With that taken care of to an appropriate level of clarification, what I
    ask the team at large to help out for, is to flesh out this table.

    Password protecting a .pdf file.

    Hi Rick,

    I've written tutorials on how to REMOVE PDF restrictions, but I never
    thought about *adding* PDF restrictions, such as password protection.

    So I've added to the chart of things people want to do with a PDF:
    [x] Offline encrypt PDF with a password (pdfencrypt)

    <https://pdfencrypt.net/>
    <https://pdfencrypt.net/files/setup.exe>
    Name: setup.exe
    Size: 5515925 bytes (5386 KiB)
    SHA256: 7D5B37F986EBC374772CB749FAA4B4DA39D466D14A03BEE76B13800E59131992

    What else is needed to do with a PDF that we haven't discussed in the past?

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Lawrence D'Oliveiro@21:1/5 to Marion on Fri Feb 28 21:00:53 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Fri, 28 Feb 2025 20:09:29 -0000 (UTC), Marion wrote:

    ... I think collectively we need to update this
    chart of the single best freeware for the stated PDF editing needs...

    I prefer Free software to freeware, myself.

    <https://en.wikipedia.org/wiki/Freeware> <https://en.wikipedia.org/wiki/Free_software>

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Lawrence D'Oliveiro@21:1/5 to Marion on Sat Mar 1 02:53:56 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Fri, 28 Feb 2025 23:56:37 -0000 (UTC), Marion wrote:

    [x] Add or concatenate pages
    [x] Merge PDFs
    [x] Print sans username in the properties
    [x] Remove pages
    [x] Reorder pages

    Just note that the PikePDF toolkit is very handy for performing all these
    tasks from a Python script. As an example use of it, I wrote this command-
    line tool <https://gitlab.com/ldo/acrid>, which lets you examine and change/add/delete the metadata associated with a PDF file -- both the old- style format and the XMP format.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to Lawrence D'Oliveiro on Sat Mar 1 06:24:35 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Sat, 1 Mar 2025 02:53:56 -0000 (UTC), Lawrence D'Oliveiro wrote :


    Just note that the PikePDF toolkit is very handy for performing all these tasks from a Python script. As an example use of it, I wrote this command- line tool <https://gitlab.com/ldo/acrid>, which lets you examine and change/add/delete the metadata associated with a PDF file -- both the old- style format and the XMP format.

    Thanks for the suggestion of PikePDF, which I was wholly unaware of, since
    the list was taken from discussions on the windows newsgroups over time.

    I'm surprised I missed a mention of PDF-related freeware, so I searched.

    First, I searched the c.t.p & a.c.o.w-10 archives for mention of PikePDF:
    <https://groups.google.com/g/comp.text.pdf>
    <https://www.novabbs.com/computers/search.php?group=comp.text.pdf>
    <https://www.novabbs.com/computers/search.php?group=alt.comp.os.windows-10>

    The Google Groups c.t.p search returned zero hits for "PikePDF".
    The Nova BBS c.t.p search returned one hit.
    *How to remove a link in a PDF that is found in a thousand pages*
    <https://www.novabbs.com/computers/article-flat.php?id=363&group=comp.text.pdf#363>
    Likewise, the Nova BBS a.c.o.w-10 search returned that same hit.
    <https://www.novabbs.com/computers/article-flat.php?id=79154&group=alt.comp.os.windows-10#79154>
    The contents are (verbatim, in toto):
    Write a program using a PDF-manipulation toolkit.
    I have had good results writing Python code using pikepdf
    <https://github.com/pikepdf/pikepdf>

    At that web github location is the following description:
    PikePDF: A Python library for reading and writing PDF, powered by QPDF
    Documentation: <https://pikepdf.readthedocs.io/en/latest/>
    "pikepdf is a library intended for developers who want to create,
    manipulate, parse, repair, and abuse the PDF format.
    It supports reading and write PDFs, including creating from scratch.
    Thanks to QPDF, it supports linearizing PDFs and access to
    encrypted PDFs. It is a low level library that requires knowledge
    of PDF internals and some familiarity with the PDF specification.
    It does not provide a user interface of its own."

    First, I had to look up what "QPFD" was:
    <https://sourceforge.net/projects/qpdf/>
    "QPDF is a C++ library and set of programs that inspect & manipulate
    the structure of PDF files. It can encrypt and linearize files,
    expose the internals of a PDF file, and do many other operations
    useful to end users and PDF developers."

    Linearization, by the way, is optimizing (usually for the web).

    Apparently QPDF is intended to perform content-preserving transformations
    of PDF files by changing PDF structures without altering visual contents.

    The description of PikePDF provides the following examples of what it does: [x]Pikepdf would help you build apps that do things like:
    A cartoon sketch of a pike
    [x]Copy pages from one PDF into another
    [x]Split and merge PDFs
    [x]Extract content from a PDF such as images
    [x]Replace content
    such as replacing an image without altering the rest of the file
    [x]Repair, reformat or linearize PDFs
    [x]Change the size of pages and reposition content
    [x]Optimize PDFs similar to Acrobat's features by downsampling images, [x]deduplicating
    [x]Calculate charges for a scanning project based on the materials scanned [x]Alter a PDF to meet a target specification such as PDF/A or PDF/X
    [x]Add or modify PDF metadata
    [x]Add, remove, extract, and modify PDF attachments (i.e. embedded files) [x]Create well-formed but invalid PDFs for testing purposes

    Bingo! *Add or modify PDF metadata*

    OK. I've confirmed what you've explained to us, which is the combination of Python scripts, PikePDF and the underlying QPDF can remove PDF metadata.

    I appreciate your suggested site <https://gitlab.com/ldo/acrid>, which will help the programmers here who can take advantage of your kind offering.

    Even though I go back to the sixties and seventies in programming (COBOL, Fortran77 before there was a IV, IBM Assembly, Motorola 68701, etc.) I
    swore off programming at some point, so I can't make use of these tools.

    But others who know a lot more than I do about programming certainly can.
    I would like to add it to the summary chart but it may be too eclectic.

    A more readily available program to remove metadata might be LibreOffice.
    Also PDFgear online/offline tools <https://www.pdfgear.com/>
    Also PDF24 online tools <https://tools.pdf24.org/en/remove-pdf-metadata>
    Also Sejda online tools <https://www.sejda.com/edit-pdf-metadata>

    This is getting long so let's break off a tangent for metadata removal.
    Suffice to say that removal of metadata is critically important, which
    means it behooves us to find an easy way for everyone to be able to do it.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to Marion on Sat Mar 1 06:49:51 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Sat, 1 Mar 2025 06:24:35 -0000 (UTC), Marion wrote :


    A more readily available program to remove metadata might be LibreOffice. Also PDFgear online/offline tools <https://www.pdfgear.com/>
    Also PDF24 online tools <https://tools.pdf24.org/en/remove-pdf-metadata>
    Also Sejda online tools <https://www.sejda.com/edit-pdf-metadata>

    This is getting long so let's break off a tangent for metadata removal. Suffice to say that removal of metadata is critically important, which
    means it behooves us to find an easy way for everyone to be able to do it.

    My goal, always, is to help everyone with every post, so this post will
    delve into details since I'm all about solving the problem for everyone.

    Let's break into a tangent for the removal of metadata, which is critically important for privacy, where let's just state what can be in PDF metadata.

    Some of the basic information hidden in PDF metadata can be:
    Title: The name of the document.
    Author: The person or entity who created the document.
    Subject: A brief description of the document's content.
    Keywords: Terms that describe the document's content, used for searching.
    Creation Date: The date and time when the PDF was created.
    Modification Date: The date and time when the PDF was last modified.
    Creator: The application used to create the PDF.
    Producer: The application used to convert the document to PDF.

    While more advanced hidden information in PDF metadata might be:
    XMP Metadata: More extensive and customizable metadata
    Rights Management: Information related to copyright and usage permissions.
    Security Settings: Details about encryption and access restrictions.
    Accessibility Metadata: Assistive technologies to help interpret the PDF.
    Embedded File Metadata: Embedded files (like images) have metadata too!

    What irks me on metadata is that in my Adobe Acrobat version 6 payware, I
    can't remove my PC username in the PDF metadata, which stinks for privacy.

    However, what also is an issue are online metadata-removal tools, where,
    well, having grown up during the Cold War where I had to duck and cover, I
    am leery of anything online, especially an online privacy protection tool.

    So if I discount the following online tools for removal of metadata
    [x]PDFgear online tools <https://www.pdfgear.com/>
    [x]PDF24 online tools <https://tools.pdf24.org/en/remove-pdf-metadata>
    [x]Sejda online tools <https://www.sejda.com/edit-pdf-metadata>

    That's leaves us, currently, with the following for metadata removal:
    LibreOffice <https://www.libreoffice.org/download/download-libreoffice/>
    PDFGear <https://www.pdfgear.com/pdfgear-for-windows/>

    I don't need to delve into LibreOffice for most of the people here.
    So let's take up how PDFGear does the removal of PDF metadata.
    <https://downloadfiles.pdfgear.com/releases/windows/pdfgear_setup_v2.1.12.exe>
    Name: pdfgear_setup_v2.1.12.exe
    Size: 136412680 bytes (130 MiB)
    SHA256: C8A19A4A06FB8D28812916FF1735CD4DC0F82BF16FBC5100BBEB71A44F32CCF9
    Defaults to: C:\Program Files\PDFgear

    Upon launching, PDFGear phones home via the default browser (e.g., TOR):
    <https://www.pdfgear.com/congrats/?action=install>

    Wow. I mean wow. This is kind of like Calibre, upon first inspection.
    Just wow. It does a lot. Pretty much PDF to anything (e.g., PDF to Word).
    [Of course, with varying levels of "doing something" but that's for later.]

    Let's just figure out how to use PDFGear, offline, for metadata removal.
    This is getting long in the tooth, so let's close this article with that.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to Marion on Sat Mar 1 07:21:46 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Sat, 1 Mar 2025 06:49:51 -0000 (UTC), Marion wrote :


    Let's just figure out how to use PDFGear, offline, for metadata removal.

    Given everything I do is intended also for the benefit of everyone else,
    first I need a sample PDF that has metadata that everyone can access.

    Let's try this:
    <https://www.hekatron.de/fileadmin/user_upload/testfolder/Sample.pdf>
    Name: Sample.pdf
    Size: 37545 bytes (36 KiB)
    SHA256: 2534C7C146709BD2881BF2A791F44A3EFC31A7230EA0D400AA17E3E8FE5DE279

    Good. In Adobe Acrobat 6 payware, there is metadata that we can test.
    Acrobat6:File > Document Properties > Description >
    Title: PDF Metadata Sample
    Author: Nigel Maddocks
    Subject: Test Document
    Created: 8/21/2015 1:42:21 AM <== harder to remove than you'd think
    Modified: 8/21/2015 1:45:31 AM
    Application: Acrobat PDFMaker 15 for Word <== harder to remove also
    etc.

    Since not everyone has Acrobat payware, I'll describe how to look at the metadata using PDFGear (so that everyone benefits from every action).

    Looking at the UserGuide, this appears to be the procedure (simplified):
    <https://www.pdfgear.com/windows-user-guide/introduction-pdfgear.htm>

    a. C:\Program Files\PDFgear\PDFLauncher.exe
    b. PDFgear:Open File > Sample.pdf
    c. PDFgear:Help > Document Properties
    d. You can manually remove some, but not all the document properties
    [x]Title
    [x]Author
    [x]Subject
    [x]Keywords
    [_]Creator
    [_]Producer
    [_]Created
    [_]Modified
    etc.
    e. Open in another tool to check if the metadata was removed.

    Given this worked (for some degree of "working") to remove the most
    egregious metadata, can I declare this a success for the team?
    [x] Metadata removal (LibreOffice Writer, PDFGear offline)

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Lawrence D'Oliveiro@21:1/5 to Marion on Sat Mar 1 20:17:30 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Sat, 1 Mar 2025 06:24:35 -0000 (UTC), Marion wrote:

    Thanks for the suggestion of PikePDF, which I was wholly unaware of,
    since the list was taken from discussions on the windows newsgroups over time.

    Yes, there is a difference in mentality between a gaggle of users
    accustomed to isolated, monolithic applications versus one based on a cooperating ecosystem of interlocking toolkits.

    Here’s another PDF toolkit: Poppler. This is a more extensive one, that covers both the creation and rendering of PDF files. For example, Inkscape relies on Poppler when you ask it to import pages from a PDF file into
    your illustration.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Lawrence D'Oliveiro@21:1/5 to Marion on Sat Mar 1 20:23:08 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Sat, 1 Mar 2025 07:21:46 -0000 (UTC), Marion wrote:

    Let's try this:
    <https://www.hekatron.de/fileadmin/user_upload/testfolder/Sample.pdf>

    acrid showinfo Sample.pdf
    {
    "/ModDate": "D:20150821094531+01'00'",
    "/Subject": "Test Document",
    "/Title": "PDF Metadata Sample",
    "/Comments": "",
    "/Author": "Nigel Maddocks",
    "/CreationDate": "D:20150821094221+01'00'",
    "/Keywords": "12345678",
    "/Producer": "Adobe PDF Library 15.0",
    "/Creator": "Acrobat PDFMaker 15 for Word",
    "/Company": "",
    "/SourceModified": "D:20150821084155",
    "Metadata": «see below»
    }

    acrid getxmp Sample.pdf
    <?xpacket begin="" id="W5M0MpCehiHzreSzNTczkc9d"?>
    <x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="Adobe XMP Core 5.6-c015 81.157285, 2014/12/12-00:43:15 ">
    <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
    <rdf:Description rdf:about=""
    xmlns:xmp="http://ns.adobe.com/xap/1.0/"
    xmlns:xmpMM="http://ns.adobe.com/xap/1.0/mm/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:pdf="http://ns.adobe.com/pdf/1.3/"
    xmlns:pdfx="http://ns.adobe.com/pdfx/1.3/">
    <xmp:ModifyDate>2015-08-21T09:45:31+01:00</xmp:ModifyDate>
    <xmp:CreateDate>2015-08-21T09:42:21+01:00</xmp:CreateDate>
    <xmp:MetadataDate>2015-08-21T09:45:31+01:00</xmp:MetadataDate>
    <xmp:CreatorTool>Acrobat PDFMaker 15 for Word</xmp:CreatorTool>
    <xmpMM:DocumentID>uuid:32f55d7c-7ef1-46ca-85c0-b5b91509ff82</xmpMM:DocumentID>
    <xmpMM:InstanceID>uuid:4da38cf4-2b42-417c-b34e-f529c34ac6cc</xmpMM:InstanceID>
    <xmpMM:subject>
    <rdf:Seq>
    <rdf:li>1</rdf:li>
    </rdf:Seq>
    </xmpMM:subject>
    <dc:format>application/pdf</dc:format>
    <dc:title>
    <rdf:Alt>
    <rdf:li xml:lang="x-default">PDF Metadata Sample</rdf:li>
    </rdf:Alt>
    </dc:title>
    <dc:description>
    <rdf:Alt>
    <rdf:li xml:lang="x-default">Test Document</rdf:li>
    </rdf:Alt>
    </dc:description>
    <dc:creator>
    <rdf:Seq>
    <rdf:li>Nigel Maddocks</rdf:li>
    </rdf:Seq>
    </dc:creator>
    <dc:subject>
    <rdf:Bag>
    <rdf:li>12345678</rdf:li>
    </rdf:Bag>
    </dc:subject>
    <pdf:Producer>Adobe PDF Library 15.0</pdf:Producer>
    <pdf:Keywords>12345678</pdf:Keywords>
    <pdfx:SourceModified>D:20150821084155</pdfx:SourceModified>
    <pdfx:Company/>
    <pdfx:Comments/>
    </rdf:Description>
    </rdf:RDF>
    </x:xmpmeta>





















    <?xpacket end="w"?>"

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to Lawrence D'Oliveiro on Sun Mar 2 03:02:52 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Sat, 1 Mar 2025 20:17:30 -0000 (UTC), Lawrence D'Oliveiro wrote :


    Here's another PDF toolkit: Poppler. This is a more extensive one, that covers both the creation and rendering of PDF files. For example, Inkscape relies on Poppler when you ask it to import pages from a PDF file into
    your illustration.

    You know your PDF tools for sure!

    Apparently Poppler is a free and open-source software command-line tool
    library used more on Linux than on Windows, who naming was inspired by
    <https://en.wikipedia.org/wiki/The_Problem_with_Popplers>

    But there are also Windows binaries, apparently.
    <https://poppler.freedesktop.org/>
    <https://github.com/oschwartz10612/poppler-windows>
    <https://github.com/oschwartz10612/poppler-windows/releases/tag/v24.08.0-0>
    Name: Release-24.08.0-0.zip
    Size: 15090263 bytes (14 MiB)
    SHA256: 58A6F9AE269756231D2F9AA6CBA39D75FEC6DEACAF3C4A50683383B5F3D5A527

    The poppler-utils are used to
    [x]Extract text from PDFs (pdftotext).
    [x]Extract images from PDFs (pdfimages).
    [x]Convert PDFs to other formats (pdftoppm, pdftocairo, pdftohtml).
    [x]Get information about PDFs (pdfinfo).
    [x]Merge and separate pdf files.

    To test, I downloaded it and extracted it to Windows to see these:
    pdfattach.exe: Embeds files into a PDF as attachments.
    pdfdetach.exe: Extracts embedded files (attachments) from a PDF.
    pdffonts.exe: Lists the fonts used in a PDF.
    pdfimages.exe: Extracts images from a PDF.
    pdfinfo.exe: Displays information about a PDF.
    pdfseparate.exe: Separates a PDF into individual pages.
    pdftocairo.exe: Converts PDFs to PNG, JPEG, etc. using Cairo graphics
    pdftohtml.exe: Converts PDFs to HTML.
    pdftoppm.exe: Converts PDFs to PPM/PGM/PBM image formats.
    pdftops.exe: Converts PDFs to PostScript.
    pdftotext.exe: Extracts text from a PDF.
    pdfunite.exe: Merges multiple PDFs into one.
    zstd.exe: Compresses or decompresses (requires Zstandard)

    Let's just try "pdfinfo" on the sample PDF we previously downloaded:
    <https://www.hekatron.de/fileadmin/user_upload/testfolder/Sample.pdf>

    pdfinfo Sample.pdf
    Title: PDF Metadata Sample
    Subject: Test Document
    Keywords: 12345678
    Author: Nigel Maddocks
    Creator: Acrobat PDFMaker 15 for Word
    Producer: Adobe PDF Library 15.0
    CreationDate: Fri Aug 21 02:42:21 2015 Mountain Daylight Time
    ModDate: Fri Aug 21 02:45:31 2015 Mountain Daylight Time
    Custom Metadata: yes
    Metadata Stream: yes
    Tagged: yes
    UserProperties: no
    Suspects: no
    Form: none
    JavaScript: no
    Pages: 1
    Encrypted: no
    Page size: 595.32 x 841.92 pts (A4)
    Page rot: 0
    File size: 37545 bytes
    Optimized: yes
    PDF version: 1.5

    The sample is too simple to extract images or detach files included with
    the PDF, but I'm sure those other functions work, so it's a nice addition.

    Seems like a nice tool. Too bad it doesn't remove metadata as it would be
    nice to run a find for all pdf files and to strip out the metadata in them.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to Lawrence D'Oliveiro on Sun Mar 2 03:18:18 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Sat, 1 Mar 2025 20:23:08 -0000 (UTC), Lawrence D'Oliveiro wrote :


    Let's try this:
    <https://www.hekatron.de/fileadmin/user_upload/testfolder/Sample.pdf>

    acrid showinfo Sample.pdf

    Definitely works nicely. I've added the suggestions that I think made sense
    as a general use chart for Windows users, whose current version is below.

    [?] Print book format PDF (FinePrint payware)
    [x] Add or concatenate pages (pdftk, acrobat payware)
    [x] Add signature (Adobe Reader Fill-and-sign sign-yourself tool)
    [x] Archive sites (wkhtmltopdf, Acrobat payware,fastone scroll capture)
    [x] Compress PDFs (ImageMagick, PDFgear, rlvision)
    [x] Convert PDF to MSOffice (PDFgear, Calibre for MS Word only)
    [x] Convert PDF to MSWord (Calibre, PDFgear)
    [x] Convert PDF to epub format (Calibre)
    [x] Convert PDF to PostScript (Calibre, Poppler)
    [x] Converts PDFs to HTML (poppler)
    [x] Converts PDFs to PNG, JPEG, etc (poppler) using Cairo graphics
    [x] Converts PDFs to PPM/PGM/PBM image formats (poppler)
    [x] Create PDF new text (Irfanview or Paint.NET plugins + Ghostscript)
    [x] Edit PDF existing text (Adobe Reader commenting, Acrobat payware)
    [x] Embeds files into a PDF as attachments (poppler)
    [x] Extract images (PDF Exchange Viewer, PDF Shaper, PDFgear, poppler)
    [x] Extract text (poppler)
    [x] Extracts embedded files (attachments) from a PDF (poppler)
    [x] Fastest PDF readers (Sumatra or Foxit)
    [x] Globally search & replace PDF text (Libre Office)
    [x] List fonts used in a PDF (poppler)
    [x] Merge PDFs (pdfsam, pdftk, PDFgear, Poppler)
    [x] Metadata display on command line (poppler)
    [x] Metadata removal (LibreOffice Writer, PDFgear offline)
    [x] OCR, PDF-Xchange, freeOCR (paperfile.net), GOCR (jocr.sourceforge.net)
    [x] Offline encrypt PDF with a password (pdfencrypt)
    [x] Online shrink PDF
    https://www.adobe.com/acrobat/online/compress-pdf.html
    [x] PDF text to audio file (Balabolka)
    [x] Remove pages (pdfsam, pdftk)
    [x] Remove restrictions (Ghostscript,Ghostview,ps2edit,pdfwrite,pdf2djvu)
    [x] Renumber pages (Acrobat Reader)
    [x] Reorder pages (mutool)
    [x] Rotate pages (Acrobat Reader)
    [x] Separates a PDF into individual pages (Poppler)
    [x] Split PDFs (PDFgear, Poppler)
    [x] Tile PDFs (i.e., to print large posters) (Posterazor)
    [?] What other tasks do you do to edit or modify a PDF file?

    I'm sure we're missing more important functionality than we have,
    but so far I think this takes into effect the suggestions to date.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Lawrence D'Oliveiro@21:1/5 to All on Sun Mar 2 22:03:12 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    Perhaps another one I should mention is PDFMiner. This is a bit of a
    specialist one, focused on extracting text items from a PDF page, and
    using various heuristics to try to reassemble them into larger text
    blocks.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to Lawrence D'Oliveiro on Mon Mar 3 03:17:31 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Sun, 2 Mar 2025 22:03:12 -0000 (UTC), Lawrence D'Oliveiro wrote :


    Perhaps another one I should mention is PDFMiner. This is a bit of a specialist one, focused on extracting text items from a PDF page, and
    using various heuristics to try to reassemble them into larger text
    blocks.

    Thank you for adding value to the spirit of this conversation where the PDF experts and editing experts are involved, along with the Windows users.

    Looking up what the PDFMiner Python tool can do for us, it's important to
    note it's apparently designed for extracting information from PDF files.

    I'm not quite sure how PDFMinor differs from any of the other text
    extractors (e.g., PDF to TEXT) but it seems to gather layout data also.

    While it can extract metadata, it seems to me it's mostly used to "mine"
    large assemblages of PDF files for textual data of interest to the user.

    The original PDFMiner has apparently been forked as pdfminer.six, which,
    as far as I can tell from date stamps, is still actively being updated.
    <https://github.com/euske/pdfminer>
    <https://github.com/pdfminer/pdfminer.six>

    Since it functions on windows, (within the python enviroment) and since it
    does something useful (mine text in PDFs), I'll add it to the PDF chart as
    [x] Extract text (poppler) or mine textual & metadata (pdfminersix)

    Here's the current chart, where I simply ask for more things done to PDFs.
    [?] Print book format PDF (FinePrint payware)
    [x] Add or concatenate pages (pdftk, acrobat payware)
    [x] Add signature (Adobe Reader Fill-and-sign sign-yourself tool)
    [x] Archive sites (wkhtmltopdf, Acrobat payware,fastone scroll capture)
    [x] Compress PDFs (ImageMagick, PDFgear, rlvision)
    [x] Convert PDF to MSOffice (PDFgear, Calibre for MS Word only)
    [x] Convert PDF to MSWord (Calibre, PDFgear)
    [x] Convert PDF to epub format (Calibre)
    [x] Convert PDF to PostScript (Calibre, Poppler)
    [x] Converts PDFs to HTML (poppler)
    [x] Converts PDFs to PNG, JPEG, etc (poppler) using Cairo graphics
    [x] Converts PDFs to PPM/PGM/PBM image formats (poppler)
    [x] Create PDF new text (Irfanview or Paint.NET plugins + Ghostscript)
    [x] Edit PDF existing text (Adobe Reader commenting, Acrobat payware)
    [x] Embeds files into a PDF as attachments (poppler)
    [x] Extract images (PDF Exchange Viewer, PDF Shaper, PDFgear, poppler)
    [x] Extract text (poppler) or mine textual & metadata (pdfminersix)
    [x] Extracts embedded files (attachments) from a PDF (poppler)
    [x] Fastest PDF readers (Sumatra or Foxit)
    [x] Globally search & replace PDF text (Libre Office)
    [x] List fonts used in a PDF (poppler)
    [x] Merge PDFs (pdfsam, pdftk, PDFgear, Poppler)
    [x] Metadata display on command line (poppler)
    [x] Metadata removal (LibreOffice Writer, PDFgear offline)
    [x] OCR, PDF-Xchange, freeOCR (paperfile.net), GOCR (jocr.sourceforge.net)
    [x] Offline encrypt PDF with a password (pdfencrypt)
    [x] Online shrink PDF <adobe.com/acrobat/online/compress-pdf.html>
    [x] PDF text to audio file (Balabolka)
    [x] Remove pages (pdfsam, pdftk)
    [x] Remove restrictions (Ghostscript,Ghostview,ps2edit,pdfwrite,pdf2djvu)
    [x] Renumber pages (Acrobat Reader)
    [x] Reorder pages (mutool)
    [x] Rotate pages (Acrobat Reader)
    [x] Separates a PDF into individual pages (poppler)
    [x] Split PDFs (PDFgear, Poppler)
    [x] Tile PDFs (i.e., to print large posters) (Posterazor)
    [?] What other tasks do you do to edit or modify a PDF file?

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From G@21:1/5 to Marion on Mon Mar 3 09:19:35 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    In comp.editors Marion <marion@facts.com> wrote:
    On Sun, 2 Mar 2025 22:03:12 -0000 (UTC), Lawrence D'Oliveiro wrote :

    Perhaps another one I should mention is PDFMiner. This is a bit of a
    specialist one, focused on extracting text items from a PDF page, and
    using various heuristics to try to reassemble them into larger text
    blocks.

    Thank you for adding value to the spirit of this conversation where the PDF experts and editing experts are involved, along with the Windows users.

    Looking up what the PDFMiner Python tool can do for us, it's important to note it's apparently designed for extracting information from PDF files.

    I'm not quite sure how PDFMinor differs from any of the other text
    extractors (e.g., PDF to TEXT) but it seems to gather layout data also.

    While it can extract metadata, it seems to me it's mostly used to "mine" large assemblages of PDF files for textual data of interest to the user.

    The original PDFMiner has apparently been forked as pdfminer.six, which,
    as far as I can tell from date stamps, is still actively being updated. <https://github.com/euske/pdfminer> <https://github.com/pdfminer/pdfminer.six>

    Since it functions on windows, (within the python enviroment) and since it does something useful (mine text in PDFs), I'll add it to the PDF chart as [x] Extract text (poppler) or mine textual & metadata (pdfminersix)

    Here's the current chart, where I simply ask for more things done to PDFs. [?] Print book format PDF (FinePrint payware)
    [x] Add or concatenate pages (pdftk, acrobat payware)
    [x] Add signature (Adobe Reader Fill-and-sign sign-yourself tool)
    [x] Archive sites (wkhtmltopdf, Acrobat payware,fastone scroll capture)
    [x] Compress PDFs (ImageMagick, PDFgear, rlvision)
    [x] Convert PDF to MSOffice (PDFgear, Calibre for MS Word only)
    [x] Convert PDF to MSWord (Calibre, PDFgear)
    [x] Convert PDF to epub format (Calibre)
    [x] Convert PDF to PostScript (Calibre, Poppler)
    [x] Converts PDFs to HTML (poppler)
    [x] Converts PDFs to PNG, JPEG, etc (poppler) using Cairo graphics
    [x] Converts PDFs to PPM/PGM/PBM image formats (poppler)
    [x] Create PDF new text (Irfanview or Paint.NET plugins + Ghostscript)
    [x] Edit PDF existing text (Adobe Reader commenting, Acrobat payware)
    [x] Embeds files into a PDF as attachments (poppler)
    [x] Extract images (PDF Exchange Viewer, PDF Shaper, PDFgear, poppler)
    [x] Extract text (poppler) or mine textual & metadata (pdfminersix)
    [x] Extracts embedded files (attachments) from a PDF (poppler)
    [x] Fastest PDF readers (Sumatra or Foxit)
    [x] Globally search & replace PDF text (Libre Office)
    [x] List fonts used in a PDF (poppler)
    [x] Merge PDFs (pdfsam, pdftk, PDFgear, Poppler)
    [x] Metadata display on command line (poppler)
    [x] Metadata removal (LibreOffice Writer, PDFgear offline)
    [x] OCR, PDF-Xchange, freeOCR (paperfile.net), GOCR (jocr.sourceforge.net) [x] Offline encrypt PDF with a password (pdfencrypt)
    [x] Online shrink PDF <adobe.com/acrobat/online/compress-pdf.html>
    [x] PDF text to audio file (Balabolka)
    [x] Remove pages (pdfsam, pdftk)
    [x] Remove restrictions (Ghostscript,Ghostview,ps2edit,pdfwrite,pdf2djvu)
    [x] Renumber pages (Acrobat Reader)
    [x] Reorder pages (mutool)
    [x] Rotate pages (Acrobat Reader)
    [x] Separates a PDF into individual pages (poppler)
    [x] Split PDFs (PDFgear, Poppler)
    [x] Tile PDFs (i.e., to print large posters) (Posterazor)
    [?] What other tasks do you do to edit or modify a PDF file?

    I suppose all are available for Windows, it would be useful to know which are also for Linux or Mac.

    G

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to All on Mon Mar 3 17:01:20 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 3 Mar 2025 09:19:35 GMT, G wrote :


    I suppose all are available for Windows, it would be useful to know which are also for Linux or Mac.

    Here's the Windows software, so far, that I've tested for PDF manipulation.
    <https://i.postimg.cc/jSNb7bkF/pspdf.jpg>

    However, your point about testing Linux/Mac software is well taken, given
    the c.e and c.t.p newsgroups will have folks on the Linux & Mac newsgroups.

    My history is that I cut my teeth on IBM assembly, cobol, fortran 77, etc.,
    so I grew up on PDP11, DEC/VMS, SunOS, Solaris, etc., well before my first
    real Linux (Redhat) & then Centos & Ubuntu, so I agree Linux is important.

    And, in the Silicon Valley corporate atmosphere, long ago we all used the
    Mac before the dual boot Windows/Redhat became our standard desktop PC.

    Since I always download and test (almost) every suggested pgm, what's
    needed for me to add Linux would be to dual boot to test these apps out.

    For a long while I dual booted to Ubuntu in the Unity days and as an
    indirect result, I personally abandoned the dual boot before Unity
    (thankfully) died (although I had switched the desktop by then to KDE).

    The point of that history being that while I *agree* Linux is important,
    I'm not going to be who affirms what works and what doesn't, for Linux.

    Likewise, while I have probably more Apple mobile devices than most people, being a substitute teacher has taught me that the Mac isn't designed with anywhere near, oh, shall we say "freedoms", as anything that I'm used to.

    So, while both the Mac & Linux are widely available for PDF manipulation,
    I'm not going to be the one to flesh out what works & what doesn't on 'em.

    But someone else can take up the banner and run with it, as this is Usenet. Here's a "dir /b" of my pspdf archive on Windows for software to consider:
    acrobat
    bullzip
    calibre
    cutepdf
    fileoptimizer
    fineprint
    foxit
    ghoststuff
    msoffice_save_as_pdf
    mupdf
    ocr
    pdf-xchange_viewer
    pdf2office
    pdf_text_to_audio
    pdfcreator
    pdfencrypt
    pdfgear
    pdfminersix
    pdfsam
    pdfshaper
    pdftk
    pdfxchange
    pdfxv
    poppler
    posterazor
    psutils
    sumatra
    wkhtmltox
    wps_pdf2word
    xpdf

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From G@21:1/5 to Marion on Mon Mar 3 19:08:42 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    In comp.editors Marion <marion@facts.com> wrote:

    My history is that I cut my teeth on IBM assembly, cobol, fortran 77, etc., so I grew up on PDP11, DEC/VMS, SunOS, Solaris, etc., well before my first real Linux (Redhat) & then Centos & Ubuntu, so I agree Linux is important.

    I wrote my first program on punch card... for a IBM (maybe a 340?) and the lab had half the punchers(?) from honeywell, which, of course had different character set, Fun!.
    And then the usual as you but avoided SunOs and SOlaris and went with Xenix
    and HP-UX and of course CP/M and DOS. I installed my first Linux (Yggdrasil,
    15 or so Floppy) on a Compaq Presario(I think) with a Massive 10MB HD!
    Than I bought a RedHat box and kept with it, so Fedora Core and Fedora now (KDE). I haven't used Windows for work for almost 30 years, so no ulcers.

    [snip]

    Thanks for your work here...

    G

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Peter Flynn@21:1/5 to Marion on Mon Mar 3 21:38:56 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 02/03/2025 03:18, Marion wrote:
    On Sat, 1 Mar 2025 20:23:08 -0000 (UTC), Lawrence D'Oliveiro wrote :

    Let's try this:
     <https://www.hekatron.de/fileadmin/user_upload/testfolder/Sample.pdf>

    acrid showinfo Sample.pdf

    Definitely works nicely. I've added the suggestions that I think made sense as a general use chart for Windows users, whose current version is below.

    [snip]

    Thank you, this is a hugely useful list.

    The one tool missing seems to be LaTeX, for creating PDFs, but perhaps
    "create" in this context means "convert from some other typeset format"
    rather than "typeset directly to PDF" (in industry terms, "originate")

    Peter

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Lawrence D'Oliveiro@21:1/5 to Peter Flynn on Mon Mar 3 23:35:24 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Mon, 3 Mar 2025 21:38:56 +0000, Peter Flynn wrote:

    The one tool missing seems to be LaTeX, for creating PDFs, but perhaps "create" in this context means "convert from some other typeset format" rather than "typeset directly to PDF" (in industry terms, "originate")

    Along those lines, is it also worth mentioning that the Cairo graphics
    library includes the option for rendering drawing to PDF, among its range
    of output surface types?

    <https://www.cairographics.org/manual/cairo-PDF-Surfaces.html>

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Lawrence D'Oliveiro@21:1/5 to Marion on Tue Mar 4 03:13:01 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Tue, 4 Mar 2025 02:35:38 -0000 (UTC), Marion wrote:

    Different from both Inkscape (vector) & ImageMagick (raster) is
    Ghostscript, which can rasterize PDFs ...

    Yes. It is essentially a full-function PostScript interpreter, it just has
    lots of options for the output format. Rasterizing PDF was probably a relatively minor function to add on top of that.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to Lawrence D'Oliveiro on Tue Mar 4 02:35:38 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Mon, 3 Mar 2025 23:35:24 -0000 (UTC), Lawrence D'Oliveiro wrote :


    The one tool missing seems to be LaTeX, for creating PDFs, but perhaps
    "create" in this context means "convert from some other typeset format"
    rather than "typeset directly to PDF" (in industry terms, "originate")

    Along those lines, is it also worth mentioning that the Cairo graphics library includes the option for rendering drawing to PDF, among its range
    of output surface types?

    <https://www.cairographics.org/manual/cairo-PDF-Surfaces.html>

    Regarding...
    x] Converts PDFs to PNG, JPEG, etc (poppler) using Cairo graphics
    Fist, I have to make two confessions to maintain my credibility.

    The first is: I cheat.

    I always use the ancient payware Adobe Acrobat version 6 whenever I need to convert a PDF into image formats - so I really don't know much about the concept of converting a PDF to raster or vector graphics images.

    The second confession is that I don't really know what other people want to
    do when they "say" they want to convert a PDF into an image format.

    But when I researched your poppler suggestion, I saw it did that well.

    I had never heard of Cairo until I dug into your suggestion of using it. Apparently poppler uses either the "Splash" or the "Cairo" graphics libs to translate PDF instructions into graphical drawing commands.

    Apparently Cairo drawing commands (i.e., pdftocairo) render the PDF onto something called a "surface" which itself can be either a raster image
    (PNG, JPEG, etc.) or a vector surface (DXF, SVG, etc.) for high quality.

    Since I use the payware, I don't know what other solutions exist, so I dug
    into the concept a bit to find that if you feed Inkscape a PDF, it attempts
    to interpret the vector elements (lines, shapes, text) to make them
    editable (which could be powerful for those changing PDF images).

    In addition, it seems Inkscape can trace a bitmap to vectorize raster
    images within a PDF into vector paths - which is useful for scaling.

    On the other hand, I found in my searches today that ImageMagick is
    apparently very good at converting PDFs into raster image formats
    (like PNG, JPEG, PPM & TIFF).

    Different from both Inkscape (vector) & ImageMagick (raster) is
    Ghostscript, which can rasterize PDFs, but using a command line (unless you combine it with GhostView) where Ghostscript splits & merges PDFs also.

    Having confused myself with what I said above, I should ask what people are trying to do with the image when they want to convert PDF to an image.

    And do they want vector images out of the PDF. Or raster?
    [x] Convert PDF to raster (Imagemagick,GhostScript,Poppler-pdftocairo)
    [x] Convert PDF to vector (Inkscape, Poppler-pdftocairo)

    Anything else (which is free to use)?

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to Peter Flynn on Tue Mar 4 03:31:03 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Mon, 3 Mar 2025 21:38:56 +0000, Peter Flynn wrote :


    Thank you, this is a hugely useful list.'

    Thanks. It needs updating but it covers most of what I've needed to do.
    The hard part is keeping it to a single line per need, which necessitates excessive shortening of the descriptions.

    I think you're talking about this line item below:
    [x] Create PDF new text (Irfanview or Paint.NET plugins + Ghostscript)
    Where that's really about ADDING text to an existing PDF document.

    Given the confusion inherent in the lousy way I wrote it, I'll change it to
    [x] Add text to existing pdf (Irfanview or Paint.NET plugins + Ghostscript)

    Moving forward on your point below with LaTeX, I agree with you that we
    need a line item for creating PDFs from scratch using a markup language.

    The one tool missing seems to be LaTeX, for creating PDFs, but perhaps "create" in this context means "convert from some other typeset format" rather than "typeset directly to PDF" (in industry terms, "originate")

    Given LaTeX is the de facto standard for creating mathematical and
    scientific documents, I agree with you that it belongs as a line item.
    [x] Generate complex PDF using markup language (LaTeX via pdfTeX or LuaTeX)

    When I researched what else that is no cost which generates PDFs, most were programming libraries, such as ReportLab, PDFKit, jsPDF & PDFSharp.

    Contrasting with those programming libraries (which require programming
    code), LaTeX is a markup language and typesetting system. We write the document's content and structure using LaTeX commands, and LaTeX handles
    the visual formatting to PDF.

    So I won't include the programming libraries in that new line, for now.
    Does that clarify the two lines better for 'creating' & 'generating' PDF?

    [x] Add text to existing pdf (Irfanview or Paint.NET plugins + Ghostscript)
    [x] Generate complex PDF using markup language (LaTeX via pdfTeX or LuaTeX)

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to All on Tue Mar 4 03:52:13 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 3 Mar 2025 19:08:42 GMT, G wrote :


    I wrote my first program on punch card... for a IBM (maybe a 340?) and the lab
    had half the punchers(?) from honeywell, which, of course had different character set, Fun!.

    Yeah. You reminded me of the computer rooms with the raised floors (for the A/C) and the big magtapes (I still have one somewhere). When I wrote my
    first program in school, it was in punched cards and Fortran 77 on an IBM,
    oh, maybe an 1130?

    I remember the punched tape machine sat there, unused, while we employed
    the "more modern" boxes of punched cards. Do I remember correctly that a
    box was about two thousand lines of Fortran code? Most of my code was about
    a quarter to, at most, a half a box, so that's probably 500 to 1000 lines, excluding the obligatory IBM JCL.

    At some point (late seventies?) I was writing in PL/1 and about a decade
    later in hex (when I wire wrapped my own Motorola 68701 microcontrollers).

    At some point in time I swore off programming languages after concluding
    that they all do teh same damn thing, only with different syntax. :)

    I got sick of the syntax requirements the older I got. :)
    Now the only thing I 'program' in is the Windows command line. :)

    That's why when Lawrence mentioned Python, I shuddered. I still have
    nightmares about having to look up "Error 52" or something like that in the
    IBM 1130 documentation. Kids nowadays have no idea how that used to be! :)

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Lawrence D'Oliveiro@21:1/5 to Marion on Tue Mar 4 04:41:22 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Tue, 4 Mar 2025 03:52:13 -0000 (UTC), Marion wrote:

    At some point in time I swore off programming languages after concluding
    that they all do teh same damn thing, only with different syntax. :)

    New ways of viewing the programming problem often lead to major
    improvements in programmer productivity.

    Remember Brooks’ Law (or one of them): once a code base reaches a certain size, a skilled programmer that is familiar with it is only able to
    contribute about 10 lines of suitably-debugged code per day. And that
    applies across a wide range of language abstraction levels, from assembler
    all the way up to what he called “metaprogramming” languages, or very- high-level languages. Consider how little 10 lines of assembler can do,
    versus 10 lines of shell script or Python or Lisp code.

    So the only way to improve programmer productivity is to move to higher-
    and higher-level languages.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Anton Shepelev@21:1/5 to All on Tue Mar 4 19:13:33 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    Marion:

    It has been a few years... I think collectively we need to
    update this chart of the single best freeware for the
    stated PDF editing needs...

    [?] Print book format PDF (FinePrint payware)

    I believe LaTeX has packages for that. I have produced PDF
    booklets from Postscrpt, with psbook and psnup. The
    incoming PostScipt was mine, from either LaTeX or GNU Troff.

    [x] Convert PDF to MSWord or any epub format & vice versa (Calibre)
    [x] Edit PDF existing text (Adobe Reader commenting, Acrobat payware)
    [x] Globally search & replace PDF text (Libre Office)

    PDF is meant to be a final format not meant for editing.
    Keep it so.

    [x] Extract images (PDF Exchange Viewer, PDF Shaper)

    SumatraPDF

    [x] Reorder pages (mutool)
    [x] Rotate pages (Acrobat Reader)

    pdftk of course.

    What are your suggestions (so that everyone benefits from
    your knowledge)?

    The obvious one -- typsetting software for producing PDFs
    from text, e.g.: LaTeX, (GNU) Troff.

    --
    () ascii ribbon campaign -- against html e-mail
    /\ www.asciiribbon.org -- against proprietary attachments

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Tim Slattery@21:1/5 to Marion on Tue Mar 4 10:23:06 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    Marion <marion@facts.com> wrote:

    On 3 Mar 2025 19:08:42 GMT, G wrote :


    I wrote my first program on punch card... for a IBM (maybe a 340?) and the lab
    had half the punchers(?) from honeywell, which, of course had different
    character set, Fun!.

    Yeah. You reminded me of the computer rooms with the raised floors (for the >A/C) and the big magtapes (I still have one somewhere). When I wrote my
    first program in school, it was in punched cards and Fortran 77 on an IBM, >oh, maybe an 1130?

    For me it was an IBM 1620. This was the late '60s at Palo Alto High
    School. The machine belonged to the school district. The school
    district office was right next to the school, so we got to use their
    machine.

    I remember the punched tape machine sat there, unused, while we employed
    the "more modern" boxes of punched cards. Do I remember correctly that a
    box was about two thousand lines of Fortran code? Most of my code was about
    a quarter to, at most, a half a box, so that's probably 500 to 1000 lines, >excluding the obligatory IBM JCL.

    A box of Hollerith (or IBM) cards held 2,000 cards. Each FORTRAN
    statement would go on a separate card, so 2,000 FORTRAN statements is
    right. And if you dropped your box and spilled your cards, good luck
    getting them back in the correct order! BTW: the 1620 was pre-JCL.

    --
    Tim Slattery
    timslattery <at> utexas <dot> edu

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Carlos E.R.@21:1/5 to Marion on Tue Mar 4 20:18:43 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 2025-03-04 04:31, Marion wrote:
    Given LaTeX is the de facto standard for creating mathematical and
    scientific documents, I agree with you that it belongs as a line item.
    [x] Generate complex PDF using markup language (LaTeX via pdfTeX or LuaTeX)

    LyX is a visual editor (WYSIWYM, "what you see is what you mean"
    approach), that can generate PDFs and other formats. It is related to
    LaTeX but is not the same. Although it probably uses external libraries
    to do the actual conversion.

    Libre Office can also generate PDFs, including cryptographically signed documents, probably using libraries. It can also edit PDFs.

    --
    Cheers, Carlos.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Peter Flynn@21:1/5 to Tim Slattery on Tue Mar 4 22:39:00 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 04/03/2025 15:23, Tim Slattery wrote:
    [snip]
    A box of Hollerith (or IBM) cards held 2,000 cards. Each FORTRAN
    statement would go on a separate card, so 2,000 FORTRAN statements
    is right. And if you dropped your box and spilled your cards, good
    luck getting them back in the correct order!

    In my college, the computing centre had a card sorter, which was huge
    and stood on cast-iron lion's feet which someone had painted gold :-)
    But of course it only worked if your program statements or data lines
    (cards) were numbered. You only drop a box of cards once.

    (You may hear the voice of experience there :-)

    Peter

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Peter Flynn@21:1/5 to Marion on Tue Mar 4 22:32:26 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 04/03/2025 03:31, Marion wrote:
    On Mon, 3 Mar 2025 21:38:56 +0000, Peter Flynn wrote :
    [snip]
    Given LaTeX is the de facto standard for creating mathematical and
    scientific documents, I agree with you that it belongs as a line item.
    [...]> Contrasting with those programming libraries (which require
    programming
    code), LaTeX is a markup language and typesetting system. We write the document's content and structure using LaTeX commands, and LaTeX handles
    the visual formatting to PDF.

    There are two routes to PDF if you have XML documents (increasingly
    common; and both Word and Libre Office are XML inside). Both use
    Extensible Stylesheet Language (XSL) but in different ways

    But these may be well outside the scope of your list as they are
    two-stage processes.

    • XSL-FO uses XSL to describe the transformation to Formatting Objects
    (FO) and an FO processor converts that to PDF

    • XSLT uses XSL to describe the transformation to any text format,
    including LaTeX, which can then produce PDF.

    XSL-FO is no longer being developed by the W3C; however both methods are
    in common use in publishing.

    So I won't include the programming libraries in that new line, for now.
    Does that clarify the two lines better for 'creating' & 'generating' PDF?

    [x] Add text to existing pdf (Irfanview or Paint.NET plugins + Ghostscript) [x] Generate complex PDF using markup language (LaTeX via pdfTeX or LuaTeX)

    Yes, that looks fine, thanks.

    Peter

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Peter Flynn@21:1/5 to Anton Shepelev on Tue Mar 4 22:43:11 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 04/03/2025 16:13, Anton Shepelev wrote:
    [snip]
    PDF is meant to be a final format not meant for editing.
    Keep it so.

    Nevertheless, I have several times been able to make on-the-fly changes
    and even introduce additional material like running headers or
    paper-type change statements with

    pdf2ps file.pdf | sed -e "<stuff>" | ps2pdf newfile.pdf

    Peter

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Wolf Greenblatt@21:1/5 to Marion on Tue Mar 4 19:11:08 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Tue, 4 Mar 2025 23:58:24 -0000 (UTC), Marion wrote:

    Can someone with Python installed test it out on a sample PDF for us?

    I don't have Python but these are sample PDFs to run tests upon. <https://sample-files.com/documents/pdf/> <https://www.learningcontainer.com/sample-pdf-files-for-testing/> <https://examplefile.com/document/pdf> <https://getsamplefiles.com/sample-document-files/pdf> <https://www.graydart.com/sample/documents/pdf> <https://freetestdata.com/document-files/pdf/> <https://onlinetestcase.com/pdf-file/>

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Lawrence D'Oliveiro@21:1/5 to Peter Flynn on Tue Mar 4 23:32:00 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Tue, 4 Mar 2025 22:32:26 +0000, Peter Flynn wrote:

    There are two routes to PDF if you have XML documents (increasingly
    common; and both Word and Libre Office are XML inside). Both use
    Extensible Stylesheet Language (XSL) but in different ways

    Does anybody still use SGML? Remember, that gave birth to HTML.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Lawrence D'Oliveiro@21:1/5 to Peter Flynn on Tue Mar 4 23:34:14 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Tue, 4 Mar 2025 22:43:11 +0000, Peter Flynn wrote:

    On 04/03/2025 16:13, Anton Shepelev wrote:

    PDF is meant to be a final format not meant for editing.
    Keep it so.

    Nevertheless, I have several times been able to make on-the-fly changes
    and even introduce additional material like running headers or
    paper-type change statements with

    pdf2ps file.pdf | sed -e "<stuff>" | ps2pdf newfile.pdf

    Remember that a PDF file is not required to contain anything resembling readable text strings. If present, these are commonly stored separately
    from the rendered glyphs you seen on-screen, in a form designed for
    searching, together with an indication of which part of the page to
    highlight to represent a selection of that text. But this is all optional.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to Anton Shepelev on Tue Mar 4 23:58:24 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Tue, 4 Mar 2025 19:13:33 +0300, Anton Shepelev wrote :


    [?] Print book format PDF (FinePrint payware)

    I believe LaTeX has packages for that. I have produced PDF
    booklets from Postscrpt, with psbook and psnup. The
    incoming PostScipt was mine, from either LaTeX or GNU Troff.

    Thanks for that suggestion as, in the past, I printed booklets.

    Printing a booklet requires arranging both sides of the pages in a specific order so that when the 8.5x11-inch printed sheets are folded in half, the
    pages appear in the correct sequence as if they were in a booklet.

    [x] Convert PDF to MSWord or any epub format & vice versa (Calibre)
    [x] Edit PDF existing text (Adobe Reader commenting, Acrobat payware)
    [x] Globally search & replace PDF text (Libre Office)

    PDF is meant to be a final format not meant for editing.
    Keep it so.

    Understood. But sometimes you want to make minor edits when all you have is
    the PDF and not the original document. This happens a lot, it turns out.

    However, back to the printing of booklets, that's one thing I had trouble finding free (as in no cost) software as printing a booklet from folded 8.5x11-inch paper is more complex than standard printing, especially when dealing with double-sided printing and odd numbers of pages & title pages.

    I'm aware of "pdfbook", but, alas, that requires Python (aurgh!, again!)
    on Windows, but luckily, pdfbook should be easier to use on Linux & Mac.
    <https://pdfbook.readthedocs.io/en/latest/Installation.html>

    Unfortunately, the "examples" provided are, um, shall we say underwhelming?
    <https://pdfbook.readthedocs.io/en/latest/Examples.html>

    Digging a bit, I think something like this pdfbook command may work:
    pdfbook input.pdf --paper letter --outfile output-booklet.pdf

    Supposedly that pdfbook command will consider the number of pages in the input.pdf to then automatically order the pages so that when folded, the
    pages are in the correct order.

    Can someone with Python installed test it out on a sample PDF for us?

    [x] Extract images (PDF Exchange Viewer, PDF Shaper)

    SumatraPDF

    Thanks for that suggestion. Googling it, apparently SumatraPDF can
    *manually* copy an image which you can then paste into an image editor.

    It turns out, I think, based on what I found anyway, that SumatraPDF uses
    an underlying MuPDF library to extract images, so as a result of your
    advice, I'll add muPDF to the line for extracting images.

    While I was looking that up, I found that the free (no cost) PDF-XChange
    Editor also can extract images from a PDF, so I'll add that too.

    I think I'm going to have to give up on keeping it one line per item.
    [x] Extract images (PDF Exchange Editor, PDF Shaper, PDFgear, poppler, muPDF)

    [x] Reorder pages (mutool)
    [x] Rotate pages (Acrobat Reader)

    pdftk of course.

    Thanks for that suggestion. Checking rotate first, it seems that the
    pdftoolkit rotation of 180 degrees is a great suggestion. Much appreciated.
    pdftk input.pdf cat 1-endsouth output output.pdf

    Looking that up, I found that mutool can also rotate, e.g., for 180 degrees
    mutool convert -R 180 input.pdf output.pdf

    I found out in that search that the GUIs for PDF-XChange Editor (free) and
    PDF Arranger (free) can also rotate pages and save to a new PDF file.

    Apparently Acrobat READER can only rotate the view, but it can't SAVE the rotated results, so I'll remove Acrobat Reader from that rotation line.
    [x] Rotate pages (pdftk, mutool, PDF-XChange Editor, PDF Arranger)

    Now looking at the reordering of pages (which is really a duplicate of inserting and deleting pages, isn't it?) the same programs can re-order
    pages, but (as above) the latter two do it graphically, and pdftk is better
    at it than mutool is, but all of them can reorder pages nonetheless.

    For example, to flip the order of page 2 and 3 in a pdf using pdftk:
    pdftk input.pdf cat 1 3 2 4-end output output.pdf
    But it turned out to be difficult with mutool (possible but difficult).
    So I removed muTool because it's just too complicated to reorder with it.
    [x] Reorder pages (pdftk, PDF-XChange Editor, PDF Arranger)

    Thanks for pointing out the omissions.

    What are your suggestions (so that everyone benefits from
    your knowledge)?

    The obvious one -- typsetting software for producing PDFs
    from text, e.g.: LaTeX, (GNU) Troff.

    LaTeX seems to be what we have to fall back on when, for example, pdfbook primarily focuses on the page reordering aspect of booklet creation
    (although I'm confused since I saw mention that pdfbook is in the pdfjam package, which can be installed within a TeX distribution so maybe it can
    all be put together for everyone to easily output booket-style PDFs?).

    As noted, LaTeX has sophisticated built-in features to analyze the content
    of the PDF to more intelligently handle page breaks to avoid splitting
    images or creating an awkward text flow.

    Digging a bit into LaTeX (which I've never used myself), MiKTeX & TeX Live
    seem to be free (no cost) Windows, Linux & Mac "modern" TeX distributions.
    <https://miktex.org/howto/install-miktex>
    <https://math.asu.edu/resources/computer-resources/texlive-windows>

    Also TeXstudio or TeXworks appear to be free (no cost) LaTeX editors.
    <https://www.texstudio.org/>
    <https://www.tug.org/texworks/>

    Since the expensive cost of free (no cost) software is in the trials and tribulations to find the best ones that work, does anyone have experience
    with any of the distributions above for creating the booklet style PDFs?

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Don_from_AZ@21:1/5 to Peter Flynn on Tue Mar 4 21:28:23 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    Peter Flynn <peter@silmaril.ie> writes:

    On 04/03/2025 15:23, Tim Slattery wrote:
    [snip]
    A box of Hollerith (or IBM) cards held 2,000 cards. Each FORTRAN
    statement would go on a separate card, so 2,000 FORTRAN statements
    is right. And if you dropped your box and spilled your cards, good
    luck getting them back in the correct order!

    In my college, the computing centre had a card sorter, which was huge
    and stood on cast-iron lion's feet which someone had painted gold :-)
    But of course it only worked if your program statements or data lines
    (cards) were numbered. You only drop a box of cards once.

    (You may hear the voice of experience there :-)

    Peter

    I worked for Honeywell at a WWMMCCS site (World-Wide Military Command
    and Control System) at the Washington Navy Yard as a tech support
    guy. This was about 1973. I created new boot decks for the Honeywell 635
    as needed, with new patch cards or new configurations. I was carrying a
    tray of punched cards to the computer room, one hand on each end of the
    card tray. I tried to hook the door handle with my little finger to pull
    it open and lost my grip on the tray; cards all over the floor!
    Embarrasing to say the least, and I didn't even try to put them back in
    order, just punched out a new deck.
    --
    -Don_from_AZ-

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Paul@21:1/5 to All on Wed Mar 5 01:39:30 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Tue, 3/4/2025 11:28 PM, Don_from_AZ wrote:
    Peter Flynn <peter@silmaril.ie> writes:

    On 04/03/2025 15:23, Tim Slattery wrote:
    [snip]
    A box of Hollerith (or IBM) cards held 2,000 cards. Each FORTRAN
    statement would go on a separate card, so 2,000 FORTRAN statements
    is right. And if you dropped your box and spilled your cards, good
    luck getting them back in the correct order!

    In my college, the computing centre had a card sorter, which was huge
    and stood on cast-iron lion's feet which someone had painted gold :-)
    But of course it only worked if your program statements or data lines
    (cards) were numbered. You only drop a box of cards once.

    (You may hear the voice of experience there :-)

    Peter

    I worked for Honeywell at a WWMMCCS site (World-Wide Military Command
    and Control System) at the Washington Navy Yard as a tech support
    guy. This was about 1973. I created new boot decks for the Honeywell 635
    as needed, with new patch cards or new configurations. I was carrying a
    tray of punched cards to the computer room, one hand on each end of the
    card tray. I tried to hook the door handle with my little finger to pull
    it open and lost my grip on the tray; cards all over the floor!
    Embarrasing to say the least, and I didn't even try to put them back in order, just punched out a new deck.


    You could put numbers in column 72.

    https://upload.wikimedia.org/wikipedia/commons/thumb/5/58/FortranCardPROJ039.agr.jpg/960px-FortranCardPROJ039.agr.jpg

    You generally also need some spacing between card numbers, like use 10,20,30 then if you added a card it could be 25, then a card between 20 and 25, could be card 23. You needed a means to support program edits.

    If you needed a program label line number, that went on the left
    of the card, while anti-spill card numbering went on the right of the card. This is an example of an infinite loop, and if I spilled these two cards
    on the floor, 25 comes before 30 and we're all good. The 100 on the left
    is a program label.

    100 GOTO 100 25
    CONTINUE 30

    For small student programs, nobody bothered with column 72. For larger projects, it was considered a necessary evil.

    The easiest way to get the cards numbered, would be to copy the deck to
    the output punch and have column 72 content added automatically. I don't
    think I ever used the output punch on a mainframe. Maybe my QDGS deck
    had column 72, because that was a box-full (2000 cards for some purpose).
    I don't recollect where that deck came from -- maybe someone had it
    punched for me. The output came in a box (the Ice Queen probably put
    the 2000 cards in a box, after they were punched). The ice Queen was
    the emotionless computer operator, she wore a heavy sweater year round,
    because it was like 50F inside the computer room. I'd been on a tour
    of that computer room, and it's like working in the flash freezer
    at the fish plant. It is COLD in there. That's why she was the
    Ice Queen, as she had mastery over ice, and the ice could not get to her. Nothing got to her. I don't think the expression on her face, changed
    even once. She didn't even have a name! That's how emotionless she was.

    Paul

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Anton Shepelev@21:1/5 to All on Wed Mar 5 12:50:00 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    Marion:

    I believe LaTeX has packages for that. I have produced
    PDF booklets from Postscrpt, with psbook and psnup. The
    incoming PostScipt was mine, from either LaTeX or GNU
    Troff.

    Thanks for that suggestion as, in the past, I printed
    booklets.

    Printing a booklet requires arranging both sides of the
    pages in a specific order so that when the 8.5x11-inch
    printed sheets are folded in half, the pages appear in the
    correct sequence as if they were in a booklet.. [...]
    However, back to the printing of booklets, that's one
    thing I had trouble finding free (as in no cost) software
    as printing a booklet from folded 8.5x11-inch paper is
    more complex than standard printing, especially when
    dealing with double-sided printing and odd numbers of
    pages & title pages.

    That's exactly what I did with psbook and psnup from
    psutils. I produced an A4 booklet from an A5 document with
    sequential pages. I printed my booklent by with a normal
    single-sided printer, in two runs, without reordering the
    sheaf in between. All the rearrangement was taken care of
    during the generation of the PDF.

    Before *roff and *tex, I used to print such booklets in
    whatever software I had at hand, including MS Word '97 and
    Adobe PageMaker. For Word, I had a simple Pascal program
    that would generate two comma-spearated lists of page
    numbers, ready to paste in into the Print window, for
    printing the even and odd pages of the booklet.

    The alrorithm is rather simple, IIRC. After you append
    empty pages to make the total a multiple of four, the
    following invariant holds true for each side of any quatro:

    page_left + page_right = page_total + 1

    For example, a twelve-page booklet will be printed on three
    (12/4) sheets thusly:

    even odd
    12 1 verso 2 11
    10 3 verso 4 9
    8 5 verso 6 7

    So, you first print the odd pages in increasing order, and
    then odd ones in decreasing order, to end up with a set of
    sheats ready to fold (IIRC). I still seem to have the ugly
    ancient program in Pascal that I wrote in late school or
    early University to perfrom that task:

    https://paste.sr.ht/~shepton/4d8374ec6e2c543fa8caad43709596b1cae5cd94

    It should compile in FreePascal compiler.

    As noted, LaTeX has sophisticated built-in features to
    analyze the content of the PDF to more intelligently
    handle page breaks to avoid splitting images or creating
    an awkward text flow.

    No, LaTeX and Troff are tools to author and typeset new
    documents, rather than modify existing PDFs.

    Since the expensive cost of free (no cost) software is in
    the trials and tribulations to find the best ones that
    work,

    Which is why I prefer to use time-honoured classics.

    does anyone have experience with any of the distributions
    above for creating the booklet style PDFs?

    I have used this one a long time ago:

    https://ctan.org/pkg/booklet

    And I have used psutils (with psbook and psnup) no so long
    time ago:

    https://github.com/rrthomas/psutils

    Generally, I have found *roff much easier than LaTeX. I
    have written several Groff macros myself, including those to
    wrap text around images as shown in this newsletter:

    https://corewar.co.uk/coreops/coreops02.txt

    Both Groff and LaTeX have great and helpful communities.

    --
    () ascii ribbon campaign -- against html e-mail
    /\ www.asciiribbon.org -- against proprietary attachments

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Daniel70@21:1/5 to Paul on Wed Mar 5 20:48:45 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 5/03/2025 5:39 pm, Paul wrote:
    On Tue, 3/4/2025 11:28 PM, Don_from_AZ wrote:
    Peter Flynn <peter@silmaril.ie> writes:

    On 04/03/2025 15:23, Tim Slattery wrote: [snip]
    A box of Hollerith (or IBM) cards held 2,000 cards. Each
    FORTRAN statement would go on a separate card, so 2,000 FORTRAN
    statements is right. And if you dropped your box and spilled
    your cards, good luck getting them back in the correct order!

    In my college, the computing centre had a card sorter, which was
    huge and stood on cast-iron lion's feet which someone had painted
    gold :-) But of course it only worked if your program statements
    or data lines (cards) were numbered. You only drop a box of cards
    once.

    (You may hear the voice of experience there :-)

    Peter

    I worked for Honeywell at a WWMMCCS site (World-Wide Military
    Command and Control System) at the Washington Navy Yard as a tech
    support guy. This was about 1973. I created new boot decks for the
    Honeywell 635 as needed, with new patch cards or new
    configurations. I was carrying a tray of punched cards to the
    computer room, one hand on each end of the card tray. I tried to
    hook the door handle with my little finger to pull it open and lost
    my grip on the tray; cards all over the floor! Embarrasing to say
    the least, and I didn't even try to put them back in order, just
    punched out a new deck.


    You could put numbers in column 72.

    https://upload.wikimedia.org/wikipedia/commons/thumb/5/58/FortranCardPROJ039.agr.jpg/960px-FortranCardPROJ039.agr.jpg

    You generally also need some spacing between card numbers, like use
    10,20,30 then if you added a card it could be 25, then a card between
    20 and 25, could be card 23. You needed a means to support program
    edits.

    That was the reason stated when I started BASIC Programming (1985'ish
    .... O./K., so I was a late comer!!) "Number the Lines 10, 20, 30, etc,
    so, if you need to add a bit extra, there were all those other line
    numbers to use!!"

    --
    Daniel70

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Carlos E.R.@21:1/5 to Paul on Wed Mar 5 13:26:31 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 2025-03-05 07:39, Paul wrote:
    On Tue, 3/4/2025 11:28 PM, Don_from_AZ wrote:
    Peter Flynn <peter@silmaril.ie> writes:

    On 04/03/2025 15:23, Tim Slattery wrote:
    [snip]
    A box of Hollerith (or IBM) cards held 2,000 cards. Each FORTRAN
    statement would go on a separate card, so 2,000 FORTRAN statements
    is right. And if you dropped your box and spilled your cards, good
    luck getting them back in the correct order!

    In my college, the computing centre had a card sorter, which was huge
    and stood on cast-iron lion's feet which someone had painted gold :-)
    But of course it only worked if your program statements or data lines
    (cards) were numbered. You only drop a box of cards once.

    (You may hear the voice of experience there :-)

    Peter

    I worked for Honeywell at a WWMMCCS site (World-Wide Military Command
    and Control System) at the Washington Navy Yard as a tech support
    guy. This was about 1973. I created new boot decks for the Honeywell 635
    as needed, with new patch cards or new configurations. I was carrying a
    tray of punched cards to the computer room, one hand on each end of the
    card tray. I tried to hook the door handle with my little finger to pull
    it open and lost my grip on the tray; cards all over the floor!
    Embarrasing to say the least, and I didn't even try to put them back in
    order, just punched out a new deck.


    You could put numbers in column 72.

    https://upload.wikimedia.org/wikipedia/commons/thumb/5/58/FortranCardPROJ039.agr.jpg/960px-FortranCardPROJ039.agr.jpg

    I can not read punched cards.

    The lab at uni used them just the year before me. I never had to use
    cards or punched paper tape.

    :-D

    The trouble was, there were not enough terminals for all the students. I
    tried going somewhere where you could rent a computer by the hour.
    Finally I decided I needed a computer of my own, a PC clone. I was
    fortunate to have a family that could afford it.

    --
    Cheers, Carlos.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Paul@21:1/5 to Carlos E.R. on Wed Mar 5 09:04:41 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Wed, 3/5/2025 7:26 AM, Carlos E.R. wrote:

    I can not read punched cards.

    The lab at uni used them just the year before me. I never had to use cards or punched paper tape.

    :-D

    The trouble was, there were not enough terminals for all the students. I tried going somewhere where you could rent a computer by the hour. Finally I decided I needed a computer of my own, a PC clone. I was fortunate to have a family that could afford
    it.


    My hobby was self-financed as a student,
    so everything had to be "cheep! cheep! cheep!" :-)

    Real computers had to wait until I had a job.

    Paul

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Frank Slootweg@21:1/5 to Carlos E.R. on Wed Mar 5 14:39:02 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    Carlos E.R. <robin_listas@es.invalid> wrote:
    [...]
    I can not read punched cards.

    The lab at uni used them just the year before me. I never had to use
    cards or punched paper tape.

    :-D

    I had colleagues who could - visually - read ASCII paper tapes. And
    another one could visually read 9-track magtape, when the magnetzation
    was made visible by some kind of fluid.

    [...]

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Paul@21:1/5 to Carlos E.R. on Wed Mar 5 09:33:32 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Wed, 3/5/2025 7:26 AM, Carlos E.R. wrote:


    I can not read punched cards.


    This is a card from a Model 29. It has inked characters along the top,
    so you could read the ASCII character equivalent of the 12 row Hollerith punch.

    http://www.columbia.edu/cu/computinghistory/029-card.jpg

    Not all card schemes, were that friendly.

    You could compare the inked characters on the card, to your 132 column line printer output.

    Paul

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Paul@21:1/5 to Frank Slootweg on Wed Mar 5 14:59:06 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Wed, 3/5/2025 9:39 AM, Frank Slootweg wrote:
    Carlos E.R. <robin_listas@es.invalid> wrote:
    [...]
    I can not read punched cards.

    The lab at uni used them just the year before me. I never had to use
    cards or punched paper tape.

    :-D

    I had colleagues who could - visually - read ASCII paper tapes. And
    another one could visually read 9-track magtape, when the magnetzation
    was made visible by some kind of fluid.

    [...]

    When you use paper tape regularly, you learn how to recognize
    end of record marks. When a tape load reports a read error,
    you can roll back a record and retry, which takes less time
    than loading the paper tape all over again. Learning to do that,
    is a "survival mechanism" :-)

    Paul

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Carlos E.R.@21:1/5 to Frank Slootweg on Wed Mar 5 21:12:26 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 2025-03-05 15:39, Frank Slootweg wrote:
    Carlos E.R. <robin_listas@es.invalid> wrote:
    [...]
    I can not read punched cards.

    The lab at uni used them just the year before me. I never had to use
    cards or punched paper tape.

    :-D

    I had colleagues who could - visually - read ASCII paper tapes. And another one could visually read 9-track magtape, when the magnetzation
    was made visible by some kind of fluid.

    Wow.


    [...]


    --
    Cheers, Carlos.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Carlos E.R.@21:1/5 to Paul on Wed Mar 5 21:11:49 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 2025-03-05 15:33, Paul wrote:
    On Wed, 3/5/2025 7:26 AM, Carlos E.R. wrote:


    I can not read punched cards.


    This is a card from a Model 29. It has inked characters along the top,
    so you could read the ASCII character equivalent of the 12 row Hollerith punch.

    http://www.columbia.edu/cu/computinghistory/029-card.jpg

    Yes, I have seen those (samples).


    Not all card schemes, were that friendly.

    You could compare the inked characters on the card, to your 132 column line printer output.

    Paul


    --
    Cheers, Carlos.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Zaidy036@21:1/5 to Anton Shepelev on Wed Mar 5 16:50:02 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 3/5/2025 4:50 AM, Anton Shepelev wrote:
    Marion:

    I believe LaTeX has packages for that. I have produced
    PDF booklets from Postscrpt, with psbook and psnup. The
    incoming PostScipt was mine, from either LaTeX or GNU
    Troff.

    Thanks for that suggestion as, in the past, I printed
    booklets.

    Printing a booklet requires arranging both sides of the
    pages in a specific order so that when the 8.5x11-inch
    printed sheets are folded in half, the pages appear in the
    correct sequence as if they were in a booklet.. [...]
    However, back to the printing of booklets, that's one
    thing I had trouble finding free (as in no cost) software
    as printing a booklet from folded 8.5x11-inch paper is
    more complex than standard printing, especially when
    dealing with double-sided printing and odd numbers of
    pages & title pages.

    That's exactly what I did with psbook and psnup from
    psutils. I produced an A4 booklet from an A5 document with
    sequential pages. I printed my booklent by with a normal
    single-sided printer, in two runs, without reordering the
    sheaf in between. All the rearrangement was taken care of
    during the generation of the PDF.

    Before *roff and *tex, I used to print such booklets in
    whatever software I had at hand, including MS Word '97 and
    Adobe PageMaker. For Word, I had a simple Pascal program
    that would generate two comma-spearated lists of page
    numbers, ready to paste in into the Print window, for
    printing the even and odd pages of the booklet.

    The alrorithm is rather simple, IIRC. After you append
    empty pages to make the total a multiple of four, the
    following invariant holds true for each side of any quatro:

    page_left + page_right = page_total + 1

    For example, a twelve-page booklet will be printed on three
    (12/4) sheets thusly:

    even odd
    12 1 verso 2 11
    10 3 verso 4 9
    8 5 verso 6 7

    So, you first print the odd pages in increasing order, and
    then odd ones in decreasing order, to end up with a set of
    sheats ready to fold (IIRC). I still seem to have the ugly
    ancient program in Pascal that I wrote in late school or
    early University to perfrom that task:

    https://paste.sr.ht/~shepton/4d8374ec6e2c543fa8caad43709596b1cae5cd94

    It should compile in FreePascal compiler.

    As noted, LaTeX has sophisticated built-in features to
    analyze the content of the PDF to more intelligently
    handle page breaks to avoid splitting images or creating
    an awkward text flow.

    No, LaTeX and Troff are tools to author and typeset new
    documents, rather than modify existing PDFs.

    Since the expensive cost of free (no cost) software is in
    the trials and tribulations to find the best ones that
    work,

    Which is why I prefer to use time-honoured classics.

    does anyone have experience with any of the distributions
    above for creating the booklet style PDFs?

    I have used this one a long time ago:

    https://ctan.org/pkg/booklet

    And I have used psutils (with psbook and psnup) no so long
    time ago:

    https://github.com/rrthomas/psutils

    Generally, I have found *roff much easier than LaTeX. I
    have written several Groff macros myself, including those to
    wrap text around images as shown in this newsletter:

    https://corewar.co.uk/coreops/coreops02.txt

    Both Groff and LaTeX have great and helpful communities.

    https://www.bookletcreator.com/how-it-works/

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Peter Flynn@21:1/5 to Lawrence D'Oliveiro on Wed Mar 5 22:08:30 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 04/03/2025 23:32, Lawrence D'Oliveiro wrote:
    On Tue, 4 Mar 2025 22:32:26 +0000, Peter Flynn wrote:

    There are two routes to PDF if you have XML documents (increasingly
    common; and both Word and Libre Office are XML inside). Both use
    Extensible Stylesheet Language (XSL) but in different ways

    Does anybody still use SGML? Remember, that gave birth to HTML.

    And TEI. And DocBook. And XML. And dozens of industrial vocabularies.

    I believe a very small number of projects still use SGML for specialist technical reasons, or possibly tied to obsolete software. Projects I was associated with moved to XML the moment viable processing software
    became available. But SGML still works, and so does the old software.

    Peter

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to Anton Shepelev on Tue Mar 4 23:53:35 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Tue, 4 Mar 2025 19:13:33 +0300, Anton Shepelev wrote :


    [?] Print book format PDF (FinePrint payware)

    I believe LaTeX has packages for that. I have produced PDF
    booklets from Postscrpt, with psbook and psnup. The
    incoming PostScipt was mine, from either LaTeX or GNU Troff.

    Thanks for that suggestion as, in the past, I printed booklets.

    Printing a booklet requires arranging both sides of the pages in a specific order so that when the 8.5x11-inch printed sheets are folded in half, the
    pages appear in the correct sequence as if they were in a booklet.

    [x] Convert PDF to MSWord or any epub format & vice versa (Calibre)
    [x] Edit PDF existing text (Adobe Reader commenting, Acrobat payware)
    [x] Globally search & replace PDF text (Libre Office)

    PDF is meant to be a final format not meant for editing.
    Keep it so.

    Understood. But sometimes you want to make minor edits when all you have is
    the PDF and not the original document. This happens a lot, it turns out.

    However, back to the printing of booklets, that's one thing I had trouble finding free (as in no cost) software as printing a booklet from folded 8.5x11-inch paper is more complex than standard printing, especially when dealing with double-sided printing and odd numbers of pages & title pages.

    I'm aware of "pdfbook", but, alas, that requires Python (aurgh!, again!)
    on Windows, but luckily, pdfbook should be easier to use on Linux & Mac.
    <https://pdfbook.readthedocs.io/en/latest/Installation.html>

    Unfortunately, the "examples" provided are, um, shall we say underwhelming?
    <https://pdfbook.readthedocs.io/en/latest/Examples.html>

    Digging a bit, I think something like this pdfbook command may work:
    pdfbook input.pdf --paper letter --outfile output-booklet.pdf

    Supposedly that pdfbook command will consider the number of pages in the input.pdf to then automatically order the pages so that when folded, the
    pages are in the correct order.

    Can someone with Python installed test it out on a sample PDF for us?

    [x] Extract images (PDF Exchange Viewer, PDF Shaper)

    SumatraPDF

    Thanks for that suggestion. Googling it, apparently SumatraPDF can
    *manually* copy an image which you can then paste into an image editor.

    It turns out, I think, based on what I found anyway, that SumatraPDF uses
    an underlying MuPDF library to extract images, so as a result of your
    advice, I'll add muPDF to the line for extracting images.

    While I was looking that up, I found that the free (no cost) PDF-XChange
    Editor also can extract images from a PDF, so I'll add that too.

    I think I'm going to have to give up on keeping it one line per item.
    [x] Extract images (PDF Exchange Editor, PDF Shaper, PDFgear, poppler, muPDF)

    [x] Reorder pages (mutool)
    [x] Rotate pages (Acrobat Reader)

    pdftk of course.

    Thanks for that suggestion. Checking rotate first, it seems that the
    pdftoolkit rotation of 180 degrees is a great suggestion. Much appreciated.
    pdftk input.pdf cat 1-endsouth output output.pdf

    Looking that up, I found that mutool can also rotate, e.g., for 180 degrees
    mutool convert -R 180 input.pdf output.pdf

    I found out in that search that the GUIs for PDF-XChange Editor (free) and
    PDF Arranger (free) can also rotate pages and save to a new PDF file.

    Apparently Acrobat READER can only rotate the view, but it can't SAVE the rotated results, so I'll remove Acrobat Reader from that rotation line.
    [x] Rotate pages (pdftk, mutool, PDF-XChange Editor, PDF Arranger)

    Now looking at the reordering of pages (which is really a duplicate of inserting and deleting pages, isn't it?) the same programs can re-order
    pages, but (as above) the latter two do it graphically, and pdftk is better
    at it than mutool is, but all of them can reorder pages nonetheless.

    For example, to flip the order of page 2 and 3 in a pdf using pdftk:
    pdftk input.pdf cat 1 3 2 4-end output output.pdf
    But it turned out to be difficult with mutool (possible but difficult).
    So I removed muTool because it's just too complicated to reorder with it.
    [x] Reorder pages (pdftk, PDF-XChange Editor, PDF Arranger)

    Thanks for pointing out the omissions.

    What are your suggestions (so that everyone benefits from
    your knowledge)?

    The obvious one -- typsetting software for producing PDFs
    from text, e.g.: LaTeX, (GNU) Troff.

    LaTeX seems to be what we have to fall back on when, for example, pdfbook primarily focuses on the page reordering aspect of booklet creation
    (although I'm confused since I saw mention that pdfbook is in the pdfjam package, which can be installed within a TeX distribution so maybe it can
    all be put together for everyone to easily output booket-style PDFs?).

    As noted, LaTeX has sophisticated built-in features to analyze the content
    of the PDF to more intelligently handle page breaks to avoid splitting
    images or creating an awkward text flow.

    Digging a bit into LaTeX (which I've never used myself), MiKTeX & TeX Live
    seem to be free (no cost) Windows, Linux & Mac "modern" TeX distributions.
    <https://miktex.org/howto/install-miktex>
    <https://math.asu.edu/resources/computer-resources/texlive-windows>

    Also TeXstudio or TeXworks appear to be free (no cost) LaTeX editors.
    <https://www.texstudio.org/>
    <https://www.tug.org/texworks/>

    Since the expensive cost of free (no cost) software is in the trials and tribulations to find the best ones that work, does anyone have experience
    with any of the distributions above for creating a booklet style PDF?

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to Paul on Wed Mar 5 21:24:15 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Wed, 5 Mar 2025 14:59:06 -0500, Paul wrote :


    When you use paper tape regularly, you learn how to recognize
    end of record marks. When a tape load reports a read error,
    you can roll back a record and retry, which takes less time
    than loading the paper tape all over again. Learning to do that,
    is a "survival mechanism" :-)

    Speaking of such survival mechanisms, when I was burning EPROMs for the Motorola 68701 (probably mid 80's time frame) that I wire wrapped myself, I would write down the Assembly Language instructions, at first, on my own.

    Then, after a while, it was a "survival mechanism" to just use the hex
    instead, as what's the difference between "load accumulator A" (LDAA) and
    the hex (86) or for extended addressing, (B6) for the same command.

    It's not that big of a stretch to remember 86 versus LDAA, and it helps a
    lot when it came to burning the EEPROMs (which the 68701 had internally).

    Those days are over and gone, never to return.
    I won't need that EEPROM burner any more than a dwellmeter & timing light.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Marion@21:1/5 to Anton Shepelev on Tue Mar 4 23:24:35 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Tue, 4 Mar 2025 19:13:33 +0300, Anton Shepelev wrote :


    [?] Print book format PDF (FinePrint payware)

    I believe LaTeX has packages for that. I have produced PDF
    booklets from Postscrpt, with psbook and psnup. The
    incoming PostScipt was mine, from either LaTeX or GNU Troff.

    Thanks for that suggestion as, in the past, I printed booklets.

    Printing a booklet requires arranging both sides of the pages in a specific order so that when the 8.5x11-inch printed sheets are folded in half, the
    pages appear in the correct sequence as if they were in a booklet.

    [x] Convert PDF to MSWord or any epub format & vice versa (Calibre)
    [x] Edit PDF existing text (Adobe Reader commenting, Acrobat payware)
    [x] Globally search & replace PDF text (Libre Office)

    PDF is meant to be a final format not meant for editing.
    Keep it so.

    Understood. But sometimes you want to make minor edits when all you have is
    the PDF and not the original document. This happens a lot, it turns out.

    However, back to the printing of booklets, that's one thing I had trouble finding free (as in no cost) software as printing a booklet from folded 8.5x11-inch paper is more complex than standard printing, especially when dealing with double-sided printing and odd numbers of pages & title pages.

    I'm aware of "pdfbook", but, alas, that requires Python (aurgh!, again!)
    on Windows, but luckily, pdfbook should be easier to use on Linux & Mac.
    <https://pdfbook.readthedocs.io/en/latest/Installation.html>

    Unfortunately, the "examples" provided are, um, shall we say underwhelming?
    <https://pdfbook.readthedocs.io/en/latest/Examples.html>

    Digging a bit, I think something like this pdfbook command may work:
    pdfbook input.pdf --paper letter --outfile output-booklet.pdf

    Supposedly that pdfbook command will consider the number of pages in the input.pdf to then automatically order the pages so that when folded, the
    pages are in the correct order.

    Can someone with Python installed test it out on a sample PDF for us?

    [x] Extract images (PDF Exchange Viewer, PDF Shaper)

    SumatraPDF

    Thanks for that suggestion. Googling it, apparently SumatraPDF can
    *manually* copy an image which you can then paste into an image editor.

    It turns out, I think, based on what I found anyway, that SumatraPDF uses
    an underlying MuPDF library to extract images, so as a result of your
    advice, I'll add muPDF to the line for extracting images.

    While I was looking that up, I found that the free (no cost) PDF-XChange
    Editor also can extract images from a PDF, so I'll add that too.

    I think I'm going to have to give up on keeping it one line per item.
    [x] Extract images (PDF Exchange Editor, PDF Shaper, PDFgear, poppler, muPDF)

    [x] Reorder pages (mutool)
    [x] Rotate pages (Acrobat Reader)

    pdftk of course.

    Thanks for that suggestion. Checking rotate first, it seems that the
    pdftoolkit rotation of 180 degrees is a great suggestion. Much appreciated.
    pdftk input.pdf cat 1-endsouth output output.pdf

    Looking that up, I found that mutool can also rotate, e.g., for 180 degrees
    mutool convert -R 180 input.pdf output.pdf

    I found out in that search that the GUIs for PDF-XChange Editor (free) and
    PDF Arranger (free) can also rotate pages and save to a new PDF file.

    Apparently Acrobat READER can only rotate the view, but it can't SAVE the rotated results, so I'll remove Acrobat Reader from that rotation line.
    [x] Rotate pages (pdftk, mutool, PDF-XChange Editor, PDF Arranger)

    Now looking at the reordering of pages (which is really a duplicate of inserting and deleting pages, isn't it?) the same programs can re-order
    pages, but (as above) the latter two do it graphically, and pdftk is better
    at it than mutool is, but all of them can reorder pages nonetheless.

    For example, to flip the order of page 2 and 3 in a pdf using pdftk:
    pdftk input.pdf cat 1 3 2 4-end output output.pdf
    But it turned out to be difficult with mutool (possible but difficult).
    So I removed muTool because it's just too complicated to reorder with it.
    [x] Reorder pages (pdftk, PDF-XChange Editor, PDF Arranger)

    Thanks for pointing out the omissions.

    What are your suggestions (so that everyone benefits from
    your knowledge)?

    The obvious one -- typsetting software for producing PDFs
    from text, e.g.: LaTeX, (GNU) Troff.

    LaTeX seems to be what we have to fall back on when, for example, pdfbook primarily focuses on the page reordering aspect of booklet creation
    (although I'm confused since I saw mention that pdfbook is in the pdfjam package, which can be installed within a TeX distribution so maybe it can
    all be put together for everyone to easily output booket-style PDFs?).

    As noted, LaTeX has sophisticated built-in features to analyze the content
    of the PDF to more intelligently handle page breaks to avoid splitting
    images or creating an awkward text flow.

    Digging a bit into LaTeX (which I've never used myself), MiKTeX & TeX Live
    seem to be free (no cost) Windows, Linux & Mac "modern" TeX distributions.
    <https://miktex.org/howto/install-miktex>
    <https://math.asu.edu/resources/computer-resources/texlive-windows>

    Also TeXstudio or TeXworks appear to be free (no cost) LaTeX editors.
    <https://www.texstudio.org/>
    <https://www.tug.org/texworks/>

    Since the expensive cost of free (no cost) software is in the trials and tribulations to find the best ones that work, does anyone have experience
    with any of the distributions above for creating booklet style PDFs?

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Peter Flynn@21:1/5 to Marion on Wed Mar 5 22:39:31 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 04/03/2025 23:24, Marion wrote:
    [...]
    Printing a booklet requires arranging both sides of the pages in a
    specific order so that when the 8.5x11-inch printed sheets are
    folded in half, the pages appear in the correct sequence as if they
    were in a booklet.

    This is called "imposition". Printing a book means arranging the pages (usually) 16 (possibly 32) per side of a very large sheet (and therefore
    16 the other side) making a "signature", laid out so that when folded
    and folded and folded etc and trimmed, page 1 has page 2 on the back of
    it; then repeat for the next 32 (or 64) pages, and repeat, etc until all
    X00 pages are accounted for. Printed off a reel of paper (confusingly
    called a web) nowadays, and slit to sheets before folding. Then stacked together, the spines abraded and glued (or sewn with thread for fancy
    books), then draw on the cover (printed separately on board), glue it,
    and give it a final trim.

    However, back to the printing of booklets, that's one thing I had trouble finding free (as in no cost) software as printing a booklet from folded 8.5x11-inch paper is more complex than standard printing, especially when dealing with double-sided printing and odd numbers of pages & title pages.

    And, domestically, having it all set up and running, and then the
    cheapass paper-handling mechanism in the printer feeds two sheets
    instead of one, and messes it all up.

    I'm aware of "pdfbook", but, alas, that requires Python (aurgh!, again!)
    on Windows, but luckily, pdfbook should be easier to use on Linux & Mac.

    I don't think anyone doing this seriously would consider Windows at all.
    There is a massive collection of free text-manipulation tools known collectively as "the Unix text tools" which work on Linux (including Mac
    OSX) but which cause endless compilation trouble on Windows.

    Digging a bit into LaTeX (which I've never used myself), MiKTeX & TeX Live seem to be free (no cost) Windows, Linux & Mac "modern" TeX distributions.

    Correct. The canonical location is the TeX Users Group site (tug.org)

    Also TeXstudio or TeXworks appear to be free (no cost) LaTeX editors.

    Both are excellent but there are lots of others, including (of course)
    Emacs. https://latex.silmaril.ie/formattinginformation/editdis.html

    Since the expensive cost of free (no cost) software is in the trials and tribulations to find the best ones that work, does anyone have experience with any of the distributions above for creating booklet style PDFs?

    My typesetting business has used LaTeX and the Unix text tools for the
    last 30 years (mainly for books) without significant problems¹. There
    are some technical aspects, such as ensuring that the L–R adjustment of
    the text area on one page will correctly occupy the exact same space on
    the back of the next page when printed and bound, but this really only
    affects very large signatures where the thickness of the paper has to be
    taken into account when folder 3–4 times; this is not really relevant
    for booklets. Otherwise it's just a matter of using the right imposition
    scheme and the right page-rearrangement software to implement it.

    Peter

    ----------
    ¹ The real problems are in the copyediting and proofreading of the text,
    and that's common to all systems. Maybe marginally easier in LaTeX
    because the master source is plain text, but the real difficulties come
    in dealing with semi-literate authors and technically ill-informed editors.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Lawrence D'Oliveiro@21:1/5 to All on Thu Mar 6 02:18:53 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Wed, 5 Mar 2025 20:48:45 +1100, Daniel70 wrote:

    That was the reason stated when I started BASIC Programming (1985'ish
    .... O./K., so I was a late comer!!) "Number the Lines 10, 20, 30, etc,
    so, if you need to add a bit extra, there were all those other line
    numbers to use!!"

    Some of us went by hundreds.

    BASIC was designed as an integral part of an interactive timeshared system
    for students and staff to use at Dartmouth. Line numbers served two
    purposes: one as target labels for GOTOs, the other for ordering lines in
    the editor.

    Both uses are now obsolete.

    By the way, did you have some kind of line-renumbering utility in your
    BASIC system? That would also fix up GOTOs to correctly branch to the renumbered lines.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Paul@21:1/5 to Marion on Thu Mar 6 00:45:27 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Wed, 3/5/2025 4:24 PM, Marion wrote:
    On Wed, 5 Mar 2025 14:59:06 -0500, Paul wrote :


    When you use paper tape regularly, you learn how to recognize
    end of record marks. When a tape load reports a read error,
    you can roll back a record and retry, which takes less time
    than loading the paper tape all over again. Learning to do that,
    is a "survival mechanism" :-)

    Speaking of such survival mechanisms, when I was burning EPROMs for the Motorola 68701 (probably mid 80's time frame) that I wire wrapped myself, I would write down the Assembly Language instructions, at first, on my own.

    Then, after a while, it was a "survival mechanism" to just use the hex instead, as what's the difference between "load accumulator A" (LDAA) and
    the hex (86) or for extended addressing, (B6) for the same command.

    It's not that big of a stretch to remember 86 versus LDAA, and it helps a
    lot when it came to burning the EEPROMs (which the 68701 had internally).

    Those days are over and gone, never to return.
    I won't need that EEPROM burner any more than a dwellmeter & timing light.

    There are still EEPROMs. They're too convenient to throw away.

    For example, you can store the program code for an FPGA
    (Field Programmable Gate Array) inside an EEPROM.

    Since they keep finding new uses for them, and they
    keep going into different shaped packages, you'll never
    really be rid of them.

    Paul

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Daniel70@21:1/5 to Lawrence D'Oliveiro on Thu Mar 6 18:12:41 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 6/03/2025 1:18 pm, Lawrence D'Oliveiro wrote:
    On Wed, 5 Mar 2025 20:48:45 +1100, Daniel70 wrote:

    That was the reason stated when I started BASIC Programming
    (1985'ish .... O./K., so I was a late comer!!) "Number the Lines
    10, 20, 30, etc, so, if you need to add a bit extra, there were all
    those other line numbers to use!!"

    Some of us went by hundreds.

    BASIC was designed as an integral part of an interactive timeshared
    system for students and staff to use at Dartmouth. Line numbers
    served two purposes: one as target labels for GOTOs, the other for
    ordering lines in the editor.

    Hmm!! Probably O.T. but the 'Dartmouth' you mention .... was that
    Dartmouth, Victoria, Australia or was that somewhere in U.K.?? Or
    elsewhere??

    Both uses are now obsolete.

    By the way, did you have some kind of line-renumbering utility in
    your BASIC system? That would also fix up GOTOs to correctly branch
    to the renumbered lines.

    "line-renumbering utility"?? One's mind, maybe. ;-P
    --
    Daniel70

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Frank Slootweg@21:1/5 to Lawrence D'Oliveiro on Thu Mar 6 10:45:37 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    Lawrence D'Oliveiro <ldo@nz.invalid> wrote:
    On Wed, 5 Mar 2025 20:48:45 +1100, Daniel70 wrote:

    That was the reason stated when I started BASIC Programming (1985'ish
    .... O./K., so I was a late comer!!) "Number the Lines 10, 20, 30, etc,
    so, if you need to add a bit extra, there were all those other line
    numbers to use!!"

    Some of us went by hundreds.

    BASIC was designed as an integral part of an interactive timeshared system for students and staff to use at Dartmouth. Line numbers served two
    purposes: one as target labels for GOTOs, the other for ordering lines in
    the editor.

    Both uses are now obsolete.

    Was that by any chance one of the HP 2000 Series Timesharing Systems?

    I supported those in the early 70s, mainly at the technical
    'hogescholen' (one level below university) in The Netherlands.

    By the way, did you have some kind of line-renumbering utility in your
    BASIC system? That would also fix up GOTOs to correctly branch to the renumbered lines.

    Wasn't 'renumber' (or 'ren'?) just a command? Or is my memory playing
    tricks with me?

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Frank Slootweg@21:1/5 to Paul on Thu Mar 6 10:54:32 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    Paul <nospam@needed.invalid> wrote:
    On Wed, 3/5/2025 9:39 AM, Frank Slootweg wrote:
    Carlos E.R. <robin_listas@es.invalid> wrote:
    [...]
    I can not read punched cards.

    The lab at uni used them just the year before me. I never had to use
    cards or punched paper tape.

    :-D

    I had colleagues who could - visually - read ASCII paper tapes. And another one could visually read 9-track magtape, when the magnetzation
    was made visible by some kind of fluid.

    [...]

    When you use paper tape regularly, you learn how to recognize
    end of record marks. When a tape load reports a read error,
    you can roll back a record and retry, which takes less time
    than loading the paper tape all over again. Learning to do that,
    is a "survival mechanism" :-)

    Indeed! When we were 'generating' (installing) an HP RTE (Real Time Executive) system, we needed to load dozens of paper tapes, each of
    which came in a about 10x10cm box, so rather long papertapes. When you
    got a 'checksum error', which you were nearly guaranteed to get, you
    didn't want to start all over again.

    The system was installed *to* disk, but the (binaries) input came from
    paper tape. Later we got disk drives with a fixed disk and a removable diskpack, so we got 'Grandfather' diskpacks to generate a system,
    instead of using papertape. Bliss!

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Carlos E.R.@21:1/5 to Frank Slootweg on Thu Mar 6 12:50:59 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 2025-03-06 11:45, Frank Slootweg wrote:
    Lawrence D'Oliveiro <ldo@nz.invalid> wrote:
    On Wed, 5 Mar 2025 20:48:45 +1100, Daniel70 wrote:


    By the way, did you have some kind of line-renumbering utility in your
    BASIC system? That would also fix up GOTOs to correctly branch to the
    renumbered lines.

    Wasn't 'renumber' (or 'ren'?) just a command? Or is my memory playing tricks with me?

    It could be an external program (in basic itself), that when you run it
    loads some basic source (it is plain text) at renumbers it according to
    some criteria. It has got to find all jumps and fill a table.

    --
    Cheers, Carlos.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Philip Herlihy@21:1/5 to All on Thu Mar 6 12:17:19 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    In article <m2s253FjseU2@mid.individual.net>, peter@silmaril.ie says...
    This is called "imposition". Printing a book means arranging the pages >(usually) 16 (possibly 32) per side of a very large sheet (and therefore
    16 the other side) making a "signature", laid out so that when folded
    and folded and folded etc and trimmed, page 1 has page 2 on the back of
    it; then repeat for the next 32 (or 64) pages, and repeat, etc until all
    X00 pages are accounted for. Printed off a reel of paper (confusingly
    called a web) nowadays, and slit to sheets before folding. Then stacked >together, the spines abraded and glued (or sewn with thread for fancy
    books), then draw on the cover (printed separately on board), glue it,
    and give it a final trim.




    I don't know what the international availability of this is, but this is
    a BBC program detailing how hardback books are manufactured, with a good discussion of "imposition".

    https://www.bbc.co.uk/iplayer/episode/m0027f48/inside-the-factory- series-9-5-hardback-books

    (rejoin link with no whitespace)

    --
    --
    Phil, London

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Philip Herlihy@21:1/5 to All on Thu Mar 6 12:22:50 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    In article <vqbhsv$2sjkp$1@dont-email.me>, daniel47@eternal-
    september.org says...
    Hmm!! Probably O.T. but the 'Dartmouth' you mention .... was that
    Dartmouth, Victoria, Australia or was that somewhere in U.K.?? Or
    elsewhere??



    Dartmouth College, Hanover, New Hampshire.

    https://calltolead.dartmouth.edu/stories/celebrating-birth-basic-and-
    beyond

    --
    --
    Phil, London

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Carlos E.R.@21:1/5 to Philip Herlihy on Thu Mar 6 13:33:05 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 2025-03-06 13:17, Philip Herlihy wrote:
    In article <m2s253FjseU2@mid.individual.net>, peter@silmaril.ie says...
    This is called "imposition". Printing a book means arranging the pages
    (usually) 16 (possibly 32) per side of a very large sheet (and therefore
    16 the other side) making a "signature", laid out so that when folded
    and folded and folded etc and trimmed, page 1 has page 2 on the back of
    it; then repeat for the next 32 (or 64) pages, and repeat, etc until all
    X00 pages are accounted for. Printed off a reel of paper (confusingly
    called a web) nowadays, and slit to sheets before folding. Then stacked
    together, the spines abraded and glued (or sewn with thread for fancy
    books), then draw on the cover (printed separately on board), glue it,
    and give it a final trim.




    I don't know what the international availability of this is, but this is
    a BBC program detailing how hardback books are manufactured, with a good discussion of "imposition".

    https://www.bbc.co.uk/iplayer/episode/m0027f48/inside-the-factory- series-9-5-hardback-books

    (rejoin link with no whitespace)

    In Spain, I get:

    Sorry, BBC iPlayer isn’t available in your region.

    It looks like you’re outside of the UK. BBC iPlayer is only available in
    the UK. If you are using a proxy or VPN, please turn off any of these
    services and try again.
    For help, please go to iPlayer help and FAQs

    --
    Cheers, Carlos.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Michael Logies@21:1/5 to All on Thu Mar 6 16:19:28 2025
    I use: https://www.pdf24.org/en

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Peter Flynn@21:1/5 to Philip Herlihy on Fri Mar 7 22:43:32 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 06/03/2025 12:17, Philip Herlihy wrote:

    I don't know what the international availability of this is, but this is
    a BBC program detailing how hardback books are manufactured, with a good discussion of "imposition".

    I watched that the other night. Very well done, although it omitted the slicing, folding, and stacking in signatures. I used to live near there.

    https://www.bbc.co.uk/iplayer/episode/m0027f48/inside-the-factory-series-9-5-hardback-books

    That's an iPlayer link, AFAIK not available outside the UK.
    The normal BBC programme description is here: https://www.bbc.co.uk/programmes/m0027f48

    Peter

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Peter Flynn@21:1/5 to Lawrence D'Oliveiro on Fri Mar 7 22:36:57 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On 06/03/2025 02:18, Lawrence D'Oliveiro wrote:

    By the way, did you have some kind of line-renumbering utility in your
    BASIC system? That would also fix up GOTOs to correctly branch to the renumbered lines.

    AFAIR both HP BASIC and DEC-10 BASIC had line-renumbering (into 10s)
    that updated GOTO statements. I believe the DEC-10 BASIC was a direct descendant of Dartmouth BASIC, so presumably someone in DEC spotted the
    need and added the feature.

    Peter

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Lawrence D'Oliveiro@21:1/5 to Peter Flynn on Sat Mar 8 01:04:54 2025
    XPost: alt.comp.os.windows-11, comp.editors, comp.text.pdf

    On Fri, 7 Mar 2025 22:36:57 +0000, Peter Flynn wrote:

    On 06/03/2025 02:18, Lawrence D'Oliveiro wrote:

    By the way, did you have some kind of line-renumbering utility in your
    BASIC system? That would also fix up GOTOs to correctly branch to the
    renumbered lines.

    AFAIR both HP BASIC and DEC-10 BASIC had line-renumbering (into 10s)
    that updated GOTO statements. I believe the DEC-10 BASIC was a direct descendant of Dartmouth BASIC, so presumably someone in DEC spotted the
    need and added the feature.

    I recall on RSTS/E, there was a separate utility program called RESEQ that
    did this for BASIC-PLUS programs. It was a long time ago, but it could
    have been that it did not operate on .BAS source code at all, but on the byte-compiled executable .BAC files.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)