Copyright | Google, Inc. (c) 2013, Edward Kmett (c) 2011, Daan Leijen (c) 2000 |
---|---|
License | BSD-style (see the file LICENSE) |
Maintainer | ekmett@gmail.com |
Stability | experimental |
Portability | portable |
Safe Haskell | Safe-Inferred |
Language | Haskell98 |
Pretty print module based on Daan Leijen's implementation of Philip Wadler's "prettier printer"
"A prettier printer" Draft paper, April 1997, revised March 1998. http://homepages.inf.ed.ac.uk/wadler/papers/prettier/prettier.pdf
This is an implementation of the pretty printing combinators described by Philip Wadler (1997). In their bare essence, the combinators of Wadler are not expressive enough to describe some commonly occurring layouts. The PPrint library adds new primitives to describe these layouts and works well in practice.
The library is based on a single way to concatenate documents, which is associative and has both a left and right unit. This simple design leads to an efficient and short implementation. The simplicity is reflected in the predictable behaviour of the combinators which make them easy to use in practice.
A thorough description of the primitive combinators and their implementation can be found in Philip Wadler's paper (1997). Additions and the main differences with his original paper are:
- The nil document is called empty.
- The operator
</>
is used for soft line breaks. - There are three new primitives:
align
,fill
andfillBreak
. These are very useful in practice. - Lots of other useful combinators, like
fillSep
andlist
. - There are two renderers,
renderPretty
for pretty printing andrenderCompact
for compact output. The pretty printing algorithm also uses a ribbon-width now for even prettier output. - There are two display routines,
displayS
for strings anddisplayIO
for file based output. - There is a
Pretty
class. - The implementation uses optimised representations and strictness annotations.
- A type argument has been added and embedded
effects
can be seen in the SimpleDoc type.
- data Doc e
- putDoc :: Doc e -> IO ()
- hPutDoc :: Handle -> Doc e -> IO ()
- char :: Char -> Doc e
- text :: String -> Doc e
- nest :: Int -> Doc e -> Doc e
- line :: Doc e
- linebreak :: Doc e
- group :: Doc e -> Doc e
- softline :: Doc e
- softbreak :: Doc e
- hardline :: Doc e
- flatAlt :: Doc e -> Doc e -> Doc e
- flatten :: Doc e -> Doc e
- align :: Doc e -> Doc e
- hang :: Int -> Doc e -> Doc e
- indent :: Int -> Doc e -> Doc e
- encloseSep :: Foldable f => Doc e -> Doc e -> Doc e -> f (Doc e) -> Doc e
- list :: Foldable f => f (Doc e) -> Doc e
- tupled :: Foldable f => f (Doc e) -> Doc e
- semiBraces :: Foldable f => f (Doc e) -> Doc e
- (<+>) :: Doc e -> Doc e -> Doc e
- above :: Doc e -> Doc e -> Doc e
- (</>) :: Doc e -> Doc e -> Doc e
- aboveBreak :: Doc e -> Doc e -> Doc e
- (<//>) :: Doc e -> Doc e -> Doc e
- hsep :: Foldable f => f (Doc e) -> Doc e
- vsep :: Foldable f => f (Doc e) -> Doc e
- fillSep :: Foldable f => f (Doc e) -> Doc e
- sep :: Foldable f => f (Doc e) -> Doc e
- hcat :: Foldable f => f (Doc e) -> Doc e
- vcat :: Foldable f => f (Doc e) -> Doc e
- fillCat :: Foldable f => f (Doc e) -> Doc e
- cat :: Foldable f => f (Doc e) -> Doc e
- punctuate :: Traversable f => Doc e -> f (Doc e) -> f (Doc e)
- fill :: Int -> Doc e -> Doc e
- fillBreak :: Int -> Doc e -> Doc e
- enclose :: Doc e -> Doc e -> Doc e -> Doc e
- squotes :: Doc e -> Doc e
- dquotes :: Doc e -> Doc e
- parens :: Doc e -> Doc e
- angles :: Doc e -> Doc e
- braces :: Doc e -> Doc e
- brackets :: Doc e -> Doc e
- lparen :: Doc e
- rparen :: Doc e
- langle :: Doc e
- rangle :: Doc e
- lbrace :: Doc e
- rbrace :: Doc e
- lbracket :: Doc e
- rbracket :: Doc e
- squote :: Doc e
- dquote :: Doc e
- semi :: Doc e
- colon :: Doc e
- comma :: Doc e
- space :: Doc e
- dot :: Doc e
- backslash :: Doc e
- equals :: Doc e
- class Pretty a where
- pretty :: a -> Doc e
- prettyList :: [a] -> Doc e
- data SimpleDoc e
- renderPretty :: Float -> Int -> Doc e -> SimpleDoc e
- renderCompact :: Doc e -> SimpleDoc e
- renderSmart :: Int -> Doc e -> SimpleDoc e
- displayS :: SimpleDoc e -> ShowS
- displayIO :: Handle -> SimpleDoc e -> IO ()
- column :: (Int -> Doc e) -> Doc e
- nesting :: (Int -> Doc e) -> Doc e
- width :: Doc e -> (Int -> Doc e) -> Doc e
- columns :: (Int -> Doc e) -> Doc e
- ribbon :: (Int -> Doc e) -> Doc e
- empty :: Alternative f => forall a. f a
- (<>) :: Semigroup a => a -> a -> a
Documents
The abstract data type Doc
represents pretty documents.
Doc
is an instance of the Show
class. (show doc)
pretty
prints document doc
with a page width of 100 characters and a
ribbon width of 40 characters.
show (text "hello" `above` text "world")
Which would return the string "hello\nworld", i.e.
hello world
putDoc :: Doc e -> IO () Source
The action (putDoc doc)
pretty prints document doc
to the
standard output, with a page width of 100 characters and a ribbon
width of 40 characters.
main :: IO () main = do{ putDoc (text "hello" <+> text "world") }
Which would output
hello world
hPutDoc :: Handle -> Doc e -> IO () Source
(hPutDoc handle doc)
pretty prints document doc
to the file
handle handle
with a page width of 100 characters and a ribbon
width of 40 characters.
main = do{ handle <- openFile "MyFile" WriteMode ; hPutDoc handle (vcat (map text ["vertical","text"])) ; hClose handle }
Basic combinators
The document (char c)
contains the literal character c
. The
character shouldn't be a newline ('\n'
), the function line
should be used for line breaks.
text :: String -> Doc e Source
The document (text s)
contains the literal string s
. The
string shouldn't contain any newline ('\n'
) characters. If the
string contains newline characters, the function string
should be
used.
The line
document advances to the next line and indents to the
current nesting level. Document line
behaves like (text " ")
if the line break is undone by group
.
group :: Doc e -> Doc e Source
The group
combinator is used to specify alternative
layouts. The document (group x)
undoes all line breaks in
document x
. The resulting line is added to the current line if
that fits the page. Otherwise, the document x
is rendered without
any changes.
A linebreak that can not be flattened; it is guaranteed to be rendered as a newline.
flatAlt :: Doc e -> Doc e -> Doc e Source
flatAlt
creates a document that changes when flattened; normally
it is rendered as the first argument, but when flattened is rendered
as the second.
Alignment
align :: Doc e -> Doc e Source
The document (align x)
renders document x
with the nesting
level set to the current column. It is used for example to
implement hang
.
As an example, we will put a document right above another one, regardless of the current nesting level:
x $$ y = align (above x y)
test = text "hi" <+> (text "nice" $$ text "world")
which will be layed out as:
hi nice world
hang :: Int -> Doc e -> Doc e Source
The hang combinator implements hanging indentation. The document
(hang i x)
renders document x
with a nesting level set to the
current column plus i
. The following example uses hanging
indentation for some text:
test = hang 4 (fillSep (map text (words "the hang combinator indents these words !")))
Which lays out on a page with a width of 20 characters as:
the hang combinator indents these words !
The hang
combinator is implemented as:
hang i x = align (nest i x)
indent :: Int -> Doc e -> Doc e Source
The document (indent i x)
indents document x
with i
spaces.
test = indent 4 (fillSep (map text (words "the indent combinator indents these words !")))
Which lays out with a page width of 20 as:
the indent combinator indents these words !
encloseSep :: Foldable f => Doc e -> Doc e -> Doc e -> f (Doc e) -> Doc e Source
The document (encloseSep l r sep xs)
concatenates the documents
xs
separated by sep
and encloses the resulting document by l
and r
. The documents are rendered horizontally if that fits the
page. Otherwise they are aligned vertically. All separators are put
in front of the elements. For example, the combinator list
can be
defined with encloseSep
:
list xs = encloseSep lbracket rbracket comma xs test = text "list" <+> (list (map int [10,200,3000]))
Which is layed out with a page width of 20 as:
list [10, 200, 3000]
But when the page width is 15, it is layed out as:
list [ 10 , 200 , 3000 ]
list :: Foldable f => f (Doc e) -> Doc e Source
The document (list xs)
comma separates the documents xs
and
encloses them in square brackets. The documents are rendered
horizontally if that fits the page. Otherwise they are aligned
vertically. All comma separators are put in front of the elements.
tupled :: Foldable f => f (Doc e) -> Doc e Source
The document (tupled xs)
comma separates the documents xs
and
encloses them in parenthesis. The documents are rendered
horizontally if that fits the page. Otherwise they are aligned
vertically. All comma separators are put in front of the elements.
semiBraces :: Foldable f => f (Doc e) -> Doc e Source
The document (semiBraces xs)
separates the documents xs
with
semi colons and encloses them in braces. The documents are rendered
horizontally if that fits the page. Otherwise they are aligned
vertically. All semi colons are put in front of the elements.
Operators
above :: Doc e -> Doc e -> Doc e infixr 5 Source
The document above x y
concatenates document x
and y
with a
line
in between. (infixr 5)
(</>) :: Doc e -> Doc e -> Doc e infixr 5 Source
The document (x </> y)
concatenates document x
and y
with a
softline
in between. This effectively puts x
and y
either
next to each other (with a space
in between) or underneath each
other. (infixr 5)
aboveBreak :: Doc e -> Doc e -> Doc e infixr 5 Source
The document aboveBreak x y
concatenates document x
and y
with
a linebreak
in between. (infixr 5)
(<//>) :: Doc e -> Doc e -> Doc e infixr 5 Source
The document (x <//> y)
concatenates document x
and y
with
a softbreak
in between. This effectively puts x
and y
either
right next to each other or underneath each other. (infixr 5)
List combinators
hsep :: Foldable f => f (Doc e) -> Doc e Source
The document (hsep xs)
concatenates all documents xs
horizontally with (<+>)
.
vsep :: Foldable f => f (Doc e) -> Doc e Source
The document (vsep xs)
concatenates all documents xs
vertically with above
. If a group
undoes the line breaks
inserted by vsep
, all documents are separated with a space.
someText = map text (words ("text to lay out")) test = text "some" <+> vsep someText
This is layed out as:
some text to lay out
The align
combinator can be used to align the documents under
their first element
test = text "some" <+> align (vsep someText)
Which is printed as:
some text to lay out
fillSep :: Foldable f => f (Doc e) -> Doc e Source
The document (fillSep xs)
concatenates documents xs
horizontally with (<+>)
as long as its fits the page, then
inserts a line
and continues doing that for all documents in
xs
.
fillSep xs = foldr (</>) empty xs
sep :: Foldable f => f (Doc e) -> Doc e Source
The document (sep xs)
concatenates all documents xs
either
horizontally with (<+>)
, if it fits the page, or vertically with
above
.
sep xs = group (vsep xs)
hcat :: Foldable f => f (Doc e) -> Doc e Source
The document (hcat xs)
concatenates all documents xs
horizontally with (<>)
.
vcat :: Foldable f => f (Doc e) -> Doc e Source
The document (vcat xs)
concatenates all documents xs
vertically with aboveBreak
. If a group
undoes the line breaks
inserted by vcat
, all documents are directly concatenated.
fillCat :: Foldable f => f (Doc e) -> Doc e Source
The document (fillCat xs)
concatenates documents xs
horizontally with (<>)
as long as its fits the page, then inserts
a linebreak
and continues doing that for all documents in xs
.
fillCat xs = foldr (<//>) empty xs
cat :: Foldable f => f (Doc e) -> Doc e Source
The document (cat xs)
concatenates all documents xs
either
horizontally with (<>)
, if it fits the page, or vertically with
aboveBreak
.
cat xs = group (vcat xs)
punctuate :: Traversable f => Doc e -> f (Doc e) -> f (Doc e) Source
(punctuate p xs)
concatenates all documents in xs
with
document p
except for the last document.
someText = map text ["words","in","a","tuple"] test = parens (align (cat (punctuate comma someText)))
This is layed out on a page width of 20 as:
(words,in,a,tuple)
But when the page width is 15, it is layed out as:
(words, in, a, tuple)
(If you want put the commas in front of their elements instead of
at the end, you should use tupled
or, in general, encloseSep
.)
Fillers
fill :: Int -> Doc e -> Doc e Source
The document (fill i x)
renders document x
. It then appends
space
s until the width is equal to i
. If the width of x
is
already larger, nothing is appended. This combinator is quite
useful in practice to output a list of bindings. The following
example demonstrates this.
types = [("empty","Doc e") ,("nest","Int -> Doc e -> Doc e") ,("linebreak","Doc e")] ptype (name,tp) = fill 6 (text name) <+> text "::" <+> text tp test = text "let" <+> align (vcat (map ptype types))
Which is layed out as:
let empty :: Doc e nest :: Int -> Doc e -> Doc e linebreak :: Doc e
fillBreak :: Int -> Doc e -> Doc e Source
The document (fillBreak i x)
first renders document x
. It
then appends space
s until the width is equal to i
. If the
width of x
is already larger than i
, the nesting level is
increased by i
and a line
is appended. When we redefine ptype
in the previous example to use fillBreak
, we get a useful
variation of the previous output:
ptype (name,tp) = fillBreak 6 (text name) <+> text "::" <+> text tp
The output will now be:
let empty :: Doc e nest :: Int -> Doc e -> Doc e linebreak :: Doc e
Bracketing combinators
enclose :: Doc e -> Doc e -> Doc e -> Doc e Source
The document (enclose l r x)
encloses document x
between
documents l
and r
using (<>)
.
enclose l r x = l <> x <> r
brackets :: Doc e -> Doc e Source
Document (brackets x)
encloses document x
in square brackets,
"[" and "]".
Character documents
Pretty class
The member prettyList
is only used to define the instance Pretty
a => Pretty [a]
. In normal circumstances only the pretty
function
is used.
Pretty Bool | |
Pretty Char | |
Pretty Double | |
Pretty Float | |
Pretty Int | |
Pretty Int8 | |
Pretty Int16 | |
Pretty Int32 | |
Pretty Int64 | |
Pretty Integer | |
Pretty Word | |
Pretty Word8 | |
Pretty Word16 | |
Pretty Word32 | |
Pretty Word64 | |
Pretty () | |
Pretty ByteString | |
Pretty ByteString | |
Pretty Natural | |
Pretty Text | |
Pretty Text | |
Pretty a => Pretty [a] | |
Pretty a => Pretty (Maybe a) | |
Pretty a => Pretty (Seq a) | |
Pretty a => Pretty (NonEmpty a) | |
Pretty (Doc a) | |
(Pretty a, Pretty b) => Pretty (a, b) | |
(Pretty a, Pretty b, Pretty c) => Pretty (a, b, c) |
Rendering
The data type SimpleDoc
represents rendered documents and is
used by the display functions.
The Int
in SText
contains the length of the string. The Int
in SLine
contains the indentation for that line. The library
provides two default display functions displayS
and
displayIO
. You can provide your own display function by writing a
function from a SimpleDoc
to your own output format.
renderPretty :: Float -> Int -> Doc e -> SimpleDoc e Source
This is the default pretty printer which is used by show
,
putDoc
and hPutDoc
. (renderPretty ribbonfrac width x)
renders
document x
with a page width of width
and a ribbon width of
(ribbonfrac * width)
characters. The ribbon width is the maximal
amount of non-indentation characters on a line. The parameter
ribbonfrac
should be between 0.0
and 1.0
. If it is lower or
higher, the ribbon width will be 0 or width
respectively.
renderCompact :: Doc e -> SimpleDoc e Source
(renderCompact x)
renders document x
without adding any
indentation. Since no 'pretty' printing is involved, this
renderer is very fast. The resulting output contains fewer
characters than a pretty printed version and can be used for output
that is read by other programs.
renderSmart :: Int -> Doc e -> SimpleDoc e Source
A slightly smarter rendering algorithm with more lookahead. It provides
provide earlier breaking on deeply nested structures.
For example, consider this python-ish pseudocode:
fun(fun(fun(fun(fun([abcdefg, abcdefg])))))
If we put a softbreak (+ nesting 2) after each open parenthesis, and align
the elements of the list to match the opening brackets, this will render with
renderPretty
and a page width of 20c as:
fun(fun(fun(fun(fun([
| abcdef,
| abcdef,
]
))))) |
Where the 20c. boundary has been marked with |. Because renderPretty
only
uses one-line lookahead, it sees that the first line fits, and is stuck
putting the second and third lines after the 20c mark. In contrast,
renderSmart
will continue to check the potential document up to the end of
the indentation level. Thus, it will format the document as:
fun( | fun( | fun( | fun( | fun([ | abcdef, abcdef, ] | ))))) |
Which fits within the 20c. mark.
In addition, renderSmart
uses this lookahead to minimize the number of
lines printed, leading to more compact and visually appealing output.
Consider this example using the same syntax as above:
aaaaaaaaaaa([abc, def, ghi])
When rendered with renderPretty
and a page width of 20c, we get:
aaaaaaaaaaa([ abc
, def
, ghi ])
Whereas when rendered with renderSmart
and a page width of 20c, we get:
aaaaaaaaaaa(
[abc, def, ghi])
displayIO :: Handle -> SimpleDoc e -> IO () Source
(displayIO handle simpleDoc)
writes simpleDoc
to the file
handle handle
. This function is used for example by hPutDoc
:
hPutDoc handle doc = displayIO handle (renderPretty 0.4 100 doc)
Undocumented
Re-exported standard functions
empty :: Alternative f => forall a. f a
The identity of <|>