A note on style

Sequence of spaces normally are translated into one single space. Newlines in the input document undergo a special treatement. A newline triggers a special scanning mode that reads all following spaces and newlines. In case at least one additional newline character is read, then H^EV^EA executes the \par command. Otherwise, H^EV^EA outputs a single newline character. This process approximates T_EX process for introducting paragraph breaks and, as a result, empty lines produce paragraph breaks.

Space after commands with no argument is skipped (as in L^AT_EX) — however this is not true in math mode, as explained in section 3.2.1.

The following two subsections describe management of paragraphs and spaces after command sequences in greater detail. They can be skipped in first reading.

3.1.1 Spurious Paragraphs

Paragraphs are rendered by the means of p elements. H^EV^EA is a bit simplistic in breaking paragraphs and spurious paragraphs may be present in the final html document. Normally, as H^EV^EA never outputs p elements whose contents is made of spaces only, this should not happen very often. Unfortunately, some commands do not produce any output in L^AT_EX, while they do produce output in H^EV^EA: those commands are \label, \index etc. H^EV^EA translates \label{name} into the anchor <a id="name"></a>. As a result, the following source fragment will introduce a spurious paragraph.

Which your browser renders as follows — with additional borders emphasizing p elements.

Most of the time, such extra paragraphs remain unnoticed. Of course, they can be supressed by erasing one of the empty lines. For instance:

A similar situation occurs when a sectioning command is followed by \label and a paragraph break:

Output is so, because closing the element h2 implies re-opening a new paragraph. Your browser renders the above html fragment as follows:

\section*{A\label{section:label} section}

First paragraph.

\section*{A section}

\label{section:label}First paragraph.

In all cases, this amounts to avoiding a paragraph whose contents consists in a sole \label command.

Spurious paragraphs are more easily seen by running hevea with the command-line option -dv, which instructs hevea to add border on some of the elements it produces, including p elements.

3.1.2 Spaces after Commands

Space after commands with no argument is skipped. Consider the following example:

In the output above, the space after \open does not find its way to the output.

More generally, H^EV^EA tries to emulate L^AT_EX behaviour in all situations, but discrepancies probably exist. Thus, users are invited to make explicit what they want. This is good practice anyway, because L^AT_EX is mysterious here. Consider the following example, where the \tryspace macro is first applied and then expansed by hand:

Spacing is a bit chaotic here, the space after symbol remains when #1 is substituted for it by L^AT_EX (or H^EV^EA).

Note that, if a space before “XXX” is wanted, then one should probably write:

Finally, whether the tabulation character is a space or not is random, so avoid tabs in your source document.

3.2 Math mode

H^EV^EA math mode is not very far from normal text mode, except that all letters are shown in italics and that space after macros is echoed.

However, typesetting math formulas in html rises two difficulties. First, formulas contain symbols, such as Greek letters; second, even simple formulas do not follow the simple basic typesetting model of html.

3.2.1 Spacing in math mode

By contrast with L^AT_EX, spaces from the input are significant in math mode, this feature allows users to instruct H^EV^EA on how to put space in their formulas. For instance, \alpha\rightarrow\beta is typeset without spaces between symbols, whereas \alpha \rightarrow \beta produces these spaces.

Note that L^AT_EX ignores spaces in math mode, so that users can freely adjust H^EV^EA output without changing anything to L^AT_EX output.

3.2.2 Symbols

With respect to previous versions of H^EV^EA since the begining, the treatment of symbols has significantly evolved. Outputting symbols is now performed by using Unicode character references, an option that much more complies whith standards than the previous option of selecting a “symbol” font. Observe that this choice is now possible, because more and more browsers correctly display such references. See Figure 1 for a few such symbols.

However, this means that ancient or purposely limited browsers (such as text-oriented browsers) cannot display maths, as translated by H^EV^EA. For authors that insist on avoiding symbols that cannot be shown by any browser, H^EV^EA offers a degraded mode that outputs text in place of symbols. H^EV^EA operates in this mode when given the -textsymbols command-line option. Replacement text is in English. For instance. the “∈” symbol is replace by “in”. This is far from being satisfactory, but degraded mode may be appropriate for documents than contain few symbols.

3.2.3 Displays

Apart from containing symbols, formulas specify strong typesetting constraints: sub-elements must be combined together following patterns that departs from normal text typesetting. For instance, fractions numerators and denominators must be placed one above the other. H^EV^EA handles such constraints in display mode only.

The main two operating modes of H^EV^EA are text mode and display mode. Text mode is the mode for typesetting normal text, when in this mode, text items are echoed one following the other and paragraph breaks are just blank lines, both in input and output. The so called displayed-paragraph environments of L^AT_EX (such as center or quote) are rendered by html block-level elements (such as div or blockquote). Rendering is correct becauses both L^AT_EX displayed environments and html block-level elements start a new line. Conversly, since opening a html block-level elements means starting a new line, any text that sould appear inside a paragraph must be translated using only html text-level elements. H^EV^EA chooses to translate in-text formulas that way.

H^EV^EA display mode allows more control on text placement, since entering display mode means opening a html table element and that tables allow to control the relative position of their sub-elements. Displays come in two flavor, horizontal displays and vertical displays. An horizontal display is a one-row table, while a vertical display is a one-column table. These tables holds display sub-elements, displays sub-elements being centered vertically in horizontal display mode and horizontally in vertical display mode.

Display mode is first opened by opening a displaymath environment (e.g. by $$ or \[). Then, sub-displays are opened by L^AT_EX constructs which require them. For instance, a displayed fraction (\frac) opens a vertical display.

The distinction between text and display modes clearly appears while typesetting math formulas. An in-text formula such as $\int_1^2 xdx = \frac{3}{2}$ appears as: ∫₁² xdx =3/2, while the same formula has a better aspect in display mode:

As a consequence, H^EV^EA is more powerful in display mode and formulas should be displayed as soon as they get a bit complicated. This rule is also true in L^AT_EX but it is more strict in H^EV^EA, since html capabilities to typeset formulas inside text are quite poor. In particular, it is not possible to get in-text “real” fractions or in-text limit-like subscripts.

Users should remember that H^EV^EA is not T_EX or L^AT_EX and that H^EV^EA author neither is D. E. Knuth nor L. Lamport. Thus, some formulas may be rendered poorly. For instance, two fractions with different denominator and numerator height look strange.

The reason is that vertical displays in an horizontal display are html tables that always get centered in the vertical direction. Such a crude model cannot faithfully emulate any T_EX box placement.

Users can get an idea on how H^EV^EA combines elements in display mode by giving the -dv command-line option, which instructs H^EV^EA to add borders to the table elements introduced by displays.

3.2.4 Arrays and display mode

By contrast with formulas, which H^EV^EA attempts to render with text-level elements only when they appear inside paragraphs, L^AT_EX arrays always translate to the block-level element table, thereby introducing non-desired line breaks before and after in-text arrays. As a consequence, in-text arrays yield an acceptable output, only while alone in a paragraph.

However, since in some sense, all html tables are displayed, the array and tabular environments implicitly open display mode, thus allowing a satisfactory typesetting of formulas in arrays. More precisely, array elements whose column format specification is l, c or r are typeset in display mode (see section B.10.2).

3.3 Warnings

When H^EV^EA thinks it cannot translate a symbol or construct properly, it issues a warning. This draws user attention onto a potential problem. However, rendering may be correct.

In the following (silly) example, H^EV^EA gets nervous because of the complicated length given as argument to \hspace:

Note that all warnings can be suppressed with the -s (silent) option. When a warning reveals a real problem, it can often be cured by writing a specific macro. The next two sections introduce H^EV^EA macros, then section 4 describes how to proceed with greater detail.

3.4 Commands

Just like L^AT_EX, H^EV^EA can be seen as a macro language, macros are rewritten until no more expansion is possible. Then, either some characters (such as letters, integers…) are outputed or some internal operation (such as changing font attributes, or arranging text items in a certain manner) are performed.

This scheme favors easy extension of program capabilities by users. However, predicting program behaviour and correcting errors may prove difficult, since final output or errors may occur after several levels of macro expansion. As a consequence, users can tailor H^EV^EA to their needs, but it remains a subtle task. Nevertheless, happy L^AT_EX users should enjoy customizing H^EV^EA, since this is done by writing L^AT_EX code.

3.5 Style choices

L^AT_EX and html differ in many aspects. For instance, L^AT_EX allows fine control over text placement, whereas html does not. More symbols and font attributes are available in L^AT_EX than in html. Conversely, html has font attributes, such as color, which standard L^AT_EX has not.

Therefore, there are many situations where H^EV^EA just cannot render the visual effect of L^AT_EX constructions. Here some choices have to be made. For instance, calligraphic letters (\mathcal) are rendered in red.

If you are not satisfied with H^EV^EA rendering of text style declarations, then you can choose your own, by redefining the \cal macros, using \renewcommand, the macro redefinition operator of L^AT_EX. The key point is that you need not worry about H^EV^EA internals: just redefine the old-L^AT_EX style text-style declarations (i.e. \it, \sc, etc.) and everything should get fine:

(See sections 4 and 5 on how to make such changes while leaving your file processable by L^AT_EX, and section 10.2 for a more thorough descripton of customizing type styles).

Note that many of L^AT_EX commands and environments are defined in the hevea.hva file that H^EV^EA loads before processing any input. These constructs are written using L^AT_EX source code, in the end they invoke H^EV^EA internal commands.

Other L^AT_EX constructs, such as L^AT_EX key constructs or H^EV^EA internal commands (see section 8.3), that require special processing are defined in H^EV^EA source code. However, the vast majority of these definitions can be overridden by a redefinition. This may prove useless, since there is little point in redefining core constructs such as \newcommand for instance.

Some space	:	symbol XXX
No space	:	symbolXXX

\in:	∈	\notin:	∉
\int:	∫	\prod:	∏
\preceq:	≼	\prec:	≺
\leq:	≤	\geq:	≥
\cup:	∪	\cap:	∩
\supset:	⊃	\subset:	⊂
\supseteq:	⊇	\subseteq:	⊆

3 A note on style

3.1 Spacing, Paragraphs