Markdown 2.7.0
Total Page:16
File Type:pdf, Size:1020Kb
TUGboat, Volume 40 (2019), No. 1 25 Markdown 2.7.0: Towards lightweight 2 New features markup in TEX Between versions 2.5.3 and 2.7.0 of the Markdown Vít Novotný package, there was one important patch version, 2.5.4, and two important minor versions, 2.6.0 and Abstract 2.7.0. Version 2.5.4 added support for the TEX op- Markdown is a lightweight markup language that tion -output-directory, version 2.6.0 introduced makes it easy to write structurally simple documents. the Lua command-line interface (CLI) and added sup- Existing tools for rendering markdown documents port for the doc LATEX package [4], and version 2.7.0 to PDF treat TEX as a black box. In contrast, the introduced the user manual and content slicing. Markdown package provides support for styling and In this section, I will show the new features. Al- typesetting markdown documents in TEX, extending though the package also supports plain TEX and Con- A a TEXie’s toolbox rather than forcing her to replace TEXt, all examples are in LTEX for ease of exposition. TEX with a more limited tool. Since its release in 2016, the package has re- 2.1 Setting the output directory ceived several important updates improving the func- TEX provides the -output-directory option, which tionality and user experience. In this article, I will changes the working directory for the TEX document. reintroduce the package, and describe its new func- This allows the user to redirect auxiliary files to a tionality and documentation. separate location. However, any external programs executed using the \write18 mechanism run in the original working directory. Since the Markdown pack- 1 Introduction age executes a Lua interpreter, which expects to find auxiliary files produced by the package in the current The primary strength of T X lies perhaps in its E working directory, this presents a problem. programming and typesetting capabilities, not its To solve the problem, version 2.5.4 of the Mark- syntax. Outside mathematics, non-programmable down package introduced the new outputDir option, markup languages such as Markdown [2] provide which informs the Lua interpreter where it should a gentler learning curve, improved readability, and look for the auxiliary files. Create a text document effortless single-source publishing for an author. named document.tex with the following content: Existing tools for rendering markdown docu- ments to PDF, such as Pandoc [1, 3] and MultiMark- \documentclass{article} down, have several important disadvantages, which \usepackage[outputDir=/dev/shm]{markdown} I discussed in my previous article [5]. These dis- \begin{document} \begin{markdown} advantages include black-boxing TEX, inconsistent A First Level Header support for T X commands in markdown documents, E ==================== increased complexity of maintenance, and the lack of A Second Level Header support for online TEX services such as Overleaf. The --------------------- Markdown TEX package [5] overcomes all of these. Now is the time for all good men to come to the In my previous article, I introduced version 2.5.3 aid of their country. This is just a paragraph. of the Markdown package, which was plagued by sev- \end{markdown} eral shortcomings: The package would not function \end{document} correctly when the -output-directory TEX option Execute the following command to produce a doc- was specified, since the package interacts with an ex- ument with a single section, subsection, and para- ternal Lua interpreter that is unaware of TEX options. graph, redirecting auxiliary files to /dev/shm (the The package was also wasteful with system resources, RAM disk available on recent Linux kernels): clogging up the file system with converted markdown documents unless the user cleaned them up manually. pdflatex -output-directory /dev/shm \ The documentation of the package was complete, but -shell-escape document.tex provided little help to the non-technical user. In this article, I introduce version 2.7.0 of the 2.2 The Lua command-line interface Markdown package, which tackles the above prob- The Markdown package hands markdown documents lems and introduces several new features, such as to a Lua parser. The parser converts them to TEX content slicing and the Lua CLI. In Section 2, I will and hands them back to the package for typesetting show the new features. In Section 3, I will describe (see Figure 1). This procedure has the advantage of the new documentation of the package. being fully automated. However, it also has several Markdown 2.7.0: Towards lightweight markup in TEX 26 TUGboat, Volume 40 (2019), No. 1 is a separate Lua program that can be executed User TEX Lua from the shell. Create a text document named example.md with the following content: \jobname.tex Some of these words *are emphasized*. \jobname.markdown.in Use two asterisks for **strong emphasis**. \jobname.markdown.out Next, execute the kpsewhich markdown-cli.lua command to find the location of the Lua CLI, such as \input\jobname.markdown.out /usr/local/texlive/2019/texmf-dist/scripts/ markdown/markdown-cli.lua on GNU/Linux with T X Live 2019. Execute hlocation of the Lua \jobname.pdf E texlua CLI i -- example.md example.tex to convert the example.md markdown document to TEX. Finally, create a text document named with Figure 1: A sequence diagram of the Markdown document.tex package typesetting a markdown document using the the following content: TEX interface. \documentclass{article} \usepackage{markdown} \begin{document} User T X Lua E \input example \end{document} hdocumenti.md Execute the pdflatex document.tex command to hdocumenti.tex produce a document with one formatted paragraph. \jobname.tex 2.3 Documenting LATEX packages \input hdocumenti The doc LATEX package makes it possible to docu- ment a LATEX document by writing a second LATEX document in the comments. This approach to docu- \jobname.pdf mentation is referred to as literate programming and is popular with LATEX package authors, as it keeps the documentation close to the documented code. Figure 2: A sequence diagram of the Markdown To encourage contributions and readability, doc- package typesetting a markdown document using the umentation can benefit from the use of a lightweight Lua command-line interface. markup language such as Markdown. To allow this use case, version 2.6.0 of the Markdown package important disadvantages: The converted TEX doc- introduced the stripPercentSigns option, which uments are cached on the file system, taking up an informs the Lua interpreter that it should strip per- increasing amount of space. Unless the TEX engine cent signs from the beginnings of lines in a mark- includes a Lua interpreter, the package also requires down document. Create a text document named shell access, which opens the door for a malicious document.dtx with the following content: actor to access the system. Last, but not least, the % \iffalse complexity of the procedure also impedes debugging. \documentclass{ltxdoc} A solution to the above problems is to decouple \usepackage[stripPercentSigns]{markdown} the conversion from the typesetting. First, the user \begin{document} converts markdown documents to TEX. Then, she \DocInput{document.dtx} typesets the TEX documents using the \input TEX \end{document} command (see Figure 2). Before the first step, the % \fi % \begin{markdown} user can remove any previously cached TEX docu- ments. Before the second step, she can transform the % * Candy. % * Gum. T X documents according to her need. During the E % * Booze. second step, she does not need to provide shell access % \end{markdown} to TEX. Since the individual steps are separated, the source of an error is immediately obvious. Execute the following command to produce a docu- To enable this workflow, version 2.6.0 of the ment with an unordered list: Markdown package introduced the Lua CLI, which pdflatex -shell-escape document.dtx Vít Novotný TUGboat, Volume 40 (2019), No. 1 27 2.4 Content slicing adheres too closely to the documented code. Some A Despite its simplicity (or perhaps because of it), LTEX packages provide a user manual that is writ- Markdown has become a popular choice for writing ten independently of the documented code. While all kinds of documents ranging from notes and lec- this improves readability, it also sacrifices literate ture slides to books that span hundreds of pages. programming and its ease of maintenance. When typesetting these documents, it is often useful Before version 2.5.6, the Markdown package only to typeset only a small part of them: Lecture slides provided technical documentation produced by liter- spanning an entire term can be chopped into lectures. ate programming. Since version 2.5.6, the package Bits and pieces from a ragtag of notes can be put to- also provided a user manual aimed at the end user gether into a single coherent document. A behemoth rather than a developer. This manual was, however, of a book that takes thirty minutes to compile may still produced using literate programming, leading take only one, when a single chapter is requested. to poor text structure. Since version 2.7.0, the user To make markdown documents more easily sty- manual is combined from three texts describing user lable, there exist syntax extensions for assigning interfaces, package options, and markdown tokens, HTML attributes to markdown elements. To enable respectively. This leads to readable documentation typesetting only a part of a markdown document, without sacrificing literate programming. version 2.7.0 of the Markdown package provides 4 Conclusion the headerAttributes and slice options. The headerAttributes option enables the Pandoc syn- TEX is a fine tool for typesetting many kinds of docu- ments. It may, however, not be the best language for tax for HTML attributes and the slice option spec- ifies which part of a document should be typeset.