(III.) Theoretical Background: CUPS, PPDs, PostScript & GhostScript

This chapter aims to give a bit of theoretical background to printing in general, and to CUPS especially.

Basics About Printing

Printing is one of the more complicated chapters in IT technology.

Earlier on in history, every developer of a program that was capable of spitting out printable output had to write his own printer drivers too. That was quite complicated, because different programs have different file formats. Even programs with the same usage, for example, word processors, often do not understand each others formats. So, there was no common interface to all printers, hence the programmers often supported only a few selected models.

A new device appearing on the market required the program authors to write a new driver if they wanted their program to support it. Also for manufacturers, it was impossible to make sure their device was supported by any program known to the world (although, there were far fewer than today.)

Having to support ten application programs and a dozen printers, meant a system administrator had to deal with 120 drivers. So the development of unified interfaces between programs and printers became an urgent need.

The appearance of “Page Description Languages”, describing the graphical representation of ink and toner on sheets of paper (or other output devices, like monitors, photo typesetters, etc.) in a common way was a move that found a big gap.

One such development was PostScript® by Adobe. It meant that an application programmer could concentrate on making his program give out a PostScript® language description of his printable page, while printing device developers could focus on making their devices PostScript® literate.

Of course, there came, over time, the development of other description methods. The most important competitors to PostScript® were PCL (“Print Control Language”, from Hewlett-Packard®), “ESC/P” (from Epson) and GDI (“Graphical Device Interface” from Microsoft®).

The appearance of these page description languages eased life, and facilitated further development for everybody. Yet the fact that there still remained different, incompatible, and competing page description languages keeps life for users, administrators, developers and manufacturers difficult enough.

PostScript® in memory - Bitmaps on Paper

PostScript® is most heavily used in professional printing environments such as PrePress and printing service industries. In the domains of UNIX® and Linux®, PostScript® is the pre-dominant standard as a PDL. Here, nearly every program generates a PostScript® representation of it's pages once you push the ‘Print’ button. Let us look at a simple example of (hand-made) PostScript® code. The following listing describes two simple drawings:

Example 1.0. PostScript® Code, handcrafted

Listing: A Snippet of PostScript "Code"

1    %!PS            % First 2 characters need to be '%!' (magic numbers).
2    % two boxes     % '%' introduces comments. The virtual PS-pen is asked
3    100 100 moveto  % to move to coordinate (100,100), then draw a
4    0 50 rlineto    % relative line 0 units to the right and 50 to the top,
5    50 0 rlineto    % go on with 50 to the right (0 to the top
6    0 -50 rlineto   % now 50 units straight down,
7    closepath       % close the "path",
8    .7 setgray      % switch to 70% gray value for the color to use and
9    fill            % fill the box with this color..
10   %               % First box is finished; next figure
11   160 100 moveto  % is constructed in an analogous way.,
12   0 60 rlineto    % but this time not just with horizontal
13   45 10 rlineto   % and vertical lines, but also with lopsided ones..
14   0 -40 rlineto   % (yes, 20% in PostScript stands for a more
15   closepath       % dark value than 70%).
16   .2 setgray      %
17   fill            % The closing command "showpage" tells
18   showpage        % the printer to eject the page...

This tells the imaginary PostScript® ‘pen’ to draw a path of a certain shape, and then fill it with different shades of gray. The first part translates into more comprehensive English as ‘Go to coordinate (100,100), draw a line with length 50 upward; then one from there to the right, then down again, and finally close this part.Now take a paint of 70% gray, and use it to fill the drawn shape.’

Example 1.1. PostScript® Code, less readable

Listing: A Snippet of PostScript "Code", as written by many PostScript-generating programs...

  %!PS           
  100 100 moveto 0 50 rlineto 50 0 rlineto  0 -50 rlineto  closepath       
  .7 setgray fill 160 100 moveto 0 60 rlineto 45 10 rlineto  0 -40 rlineto   
  closepath .2 setgray fill            
  showpage

This is the same PostScript code, but written in a much less readable way. This is how often PostScript drivers or other PostSript-generating programs would write it. It still is completely "legal" code....

Beneath is the picture which would be drawn by "Ghostview" on screen or printed by a printer on paper after its PostScript interpreter had rendered it into a raster image:

Example 1.2. Rendered PostScript®

Picture: A Snippet of a PostScript "Image"

Example 4.0 example rendered as an image.

Of course, PostScript® can be much more complicated than this simplistic example. It is a fully fledged programming language with many different operators and functions. You may even write PostScript® programs to compute the value of Pi, format a harddisk or write to a file. The main value and strength of PostScript® however lays in the field to describe the layout of graphical objects on a page: it also can scale, mirror, translate, transform, rotate and distort everything you can imagine on a piece of paper -- such as letters in different font representations, figures, shapes, shades, colors, lines, dots, raster...

A PostScript® file is a representation of one or more to-be-printed pages in a relatively abstract way. Ideally, it is meant to describe the pages in an device-independent way. PostScript® is not directly ‘visible'; it only lives on the hard disks and in RAM memory as a coded representation of future printouts.

Raster Images on Paper Sheets

What you see on a piece of paper is nearly always a ‘raster image’. Even if your brain suggests to you that your eyes see a line: take a good magnifying glass and you will discover lots of small dots... (One example to the contrary are sheets that have been drawn by ‘pen plotters’). And that is the only thing what the ‘marking engines’ of todays printers can put on paper: simple dots of different colors, size, resolution to make up a complete ‘page image’ composed of different bitmap patterns.

Different printers need the raster image prepared in different ways. Thinking about an inkjet device: depending on its resolution, the number of used inks (the very good ones need different 7 inks, while a cheaper one might have use 3), the number of available jets (some print heads have more than 100!) spitting out ink simultaneously, the ‘dithering algorithm’ used, and many other things, the final raster format and transfer order to the marking engine is heavily dependent on the exact model used.

Back in the early life of the ‘Line Printer Daemon’, printers were machines that hammered rows of ASCII text mechanically onto long media, folded as a zig-zag paper snake, drawn from cardboard boxes beneath the table... What a difference from today!

Now that you know how a PostScript® language file (which describes the page layout in a largely device independent way) is traveling to become transformed into a Raster Image, you might ask: “Well, there are different kinds of raster output devices: first they differ in their resolution; then there are the different paper sizes; it goes on with many finishing options (duplex prints, pamphlets, punched and stapled output with different sheets of colored paper being drawn from different trays, etc.). How does this fit into our model of device-independent PostScript®?”

The answer comes with so called PostScript® Printer Description (PPD files. A PPD describes all the device dependent features which can be utilized by a certain printer model. It also contains the coded commands that must be used to call certain features of the device. But PPDs are no closed book, they are simple ASCII text files.

PPDs were “invented” by Adobe to make it easy for manufacturers to implement their own features into PostScript® printers, and at the same time retain a standard way of doing so. PPDs are well documented and described by Adobe. Their specification is a de-facto open standard.

Why Specially Crafted PPDs are Now Useful Even For Non-PostScript® Printers

Now you know how PostScript®-Printers can use PPDs. But what about non-PostScript® printers? CUPS has done a very good trick: by using the same format and data structure as the PostScript® Printer Descriptions (PPDs) in the PostScript® world, it can describe the available print job options for non-PostScript® printers just the same. For its own special purposes CUPS just added a few special options (namely the line which defines the filter to be used for further processing of the PostScript® file).

So, the developers could use the same software engine to parse the Printer Description Files for available options for all sorts of printers. Of course the CUPS developers could not rely on the non-PostScript® hardware manufacturers to suddenly develop PPDs. They had to do the difficult start themselves and write them from scratch. More than 1000 of these are available through the commercial version of CUPS, called ESP PrintPro.

Meanwhile there are a lot of CUPS-specific PPDs available. Even now those are in most cases not originating from the printer manufacturers, but from Free software developers. The CUPS folks proofed it, and others followed suit: where Linux® and UNIX® printing one or two years ago still was a kludge, it is now able to support a big range of printers, including 7-color inkjets capable of pushing them to Photo Quality output.

Different Ways to get PPDs for non-PostScript® Printers

You can get PPDs to be used with CUPS and non-PostScript® printers from different areas in the Web:

first, there is the repository at www.linuxprinting.org, which lets you generate a ‘CUPS-O-Matic’-PPD online for any printer that had been supported by traditional Ghostscript printing already. This helps you to switch over to CUPS with little effort, if you wish so. If your printer was doing well with the traditional way of Ghostscript printing, take CUPS-O-Matic to plug your driver into th e CUPS system and you'll have the best of both worlds.
second, there are CUPS-PPDs for the more than 120 printer models, which are driven by the new universal stp driver. stp (stood originally for Stylus Photo) is now developed by the gimp-print project; it was started by Mike Sweet, the leading CUPS developer and is now available through gimp-print.sourceforge.net. This driver prints real Photo quality on many modern inkjets and can be configured to make 120 CUPS-PPDs along its own compilation. HP® Laser- and DeskJet, Epson® Stylus and Photo Color models as well as some Canon® and Lexmark® are covered.
third, there is the commercial extension to CUPS from the CUPS developers themselves: it is called ESP PrintPro and comes with more than 2.300 printer drivers. There are even improved imagetoraster and pstoraster filters included.

CUPS makes it really easy for manufacturers to start supporting Linux® and UNIX® printing for their models at reasonably low cost. The modular framework of CUPS facilitates to plug in any filter (=driver) with minimal effort and to access and utilize the whole printing framework that CUPS is creating.

Read more about the CUPS features in the available CUPS documentation at http://www.cups.org/documentation.html and http://www.danka.de/printpro/faq.html. Also at http://www.linuxprinting.org/ is a universal repository for all issues related to Linux® and UNIX® printing.

(III.) Some Theoretical Background:
CUPS, PPDs, PostScript® and GhostScript

Basics About Printing

PostScript® in memory - Bitmaps on Paper

Raster Images on Paper Sheets

RIP: From PostScript® to Raster

Ghostscript as a Software RIP

‘Drivers’ and ‘Filters’ in General

Drivers and Filters and Backends in CUPS

Spoolers and Printing Daemons

Conclusion: How CUPS uses the power of PPDs

Device Dependent Print Options

Where to get the PPDs for PostScript® Printers

Why Specially Crafted PPDs are Now Useful Even For Non-PostScript® Printers

Different Ways to get PPDs for non-PostScript® Printers

(III.) Some Theoretical Background: CUPS, PPDs, PostScript® and GhostScript

Basics About Printing

PostScript® in memory - Bitmaps on Paper

Raster Images on Paper Sheets

RIP: From PostScript® to Raster

Ghostscript as a Software RIP

‘Drivers’ and ‘Filters’ in General

Drivers and Filters and Backends in CUPS

Spoolers and Printing Daemons

Conclusion: How CUPS uses the power of PPDs

Device Dependent Print Options

Where to get the PPDs for PostScript® Printers

Why Specially Crafted PPDs are Now Useful Even For Non-PostScript® Printers

Different Ways to get PPDs for non-PostScript® Printers

(III.) Some Theoretical Background:
CUPS, PPDs, PostScript® and GhostScript