Some TeX Developments

The tagging project and beamer

2025-02-28T00:00:00+00:00

I talked a little while ago about work that the LaTeX Project team are doing on creating tagged PDFs automatically from ‘standard’ LaTeX sources. I promised to talk about how this might impact beamer, and was recently reminded about that.

The nature of presentations

The first thing to say is that beamer (or rather presentations/slides) are a tricky topic. There is lots of (legal) pressure for people doing teaching to provide accessible resources, which includes slides (at least to some extent). On the other hand, there are a lot of structural things that make presentations a lot more complexes from a tagging point of view than more classical ‘documents’ (articles and the like). Presentations also tend to be limited-life documents: most people will revise slides each time they present, unlike an article that once published is basically ‘frozen’. That means that the idea that documents have to work ‘unchanged’ isn’t quite the same for beamer as it is for say the article class. But work does need to be done.

Work so far

The wider reason that tagging currently doesn’t work at all with beamer is that internally, the class is ‘interesting’. There is a lot of re-boxing material, and very little in the way of semantic flow. The class also does things its own way: that means that in a lot of places, standard LaTeX code is ignored or overwritten, and so the work being done on tagging in general doesn’t apply.

Along with other members of the team, I’ve looked at beamer in detail and we’ve concluded that ‘upgrading’ to tagging isn’t realistic in the current class. What’s needed instead is a beamer replacement, taking forward key ideas (and as much of the syntax as we can), but starting from the idea of including tagging support.

Future plans

Writing something that covers everything beamer does will take a long time, so that’s not the initial target. Rather, we want to pick off specific areas. First, the simple idea of a tagged structure that looks like a slide, then moving on to simple overlays before even looking at the ‘visual effects’ that people like to use in presentations. At the moment, the code for this effort is not public, but once there is something that works for simple slides (like my own teaching), that will change.

As well as the LaTeX Project team, there are a few other people with an interest in this work. So I hope I can get things moving along once the basics are in place.

Document structure

There is a particular point to consider when tagging is that it makes little sense to try to tag slides themselves, at least unless they use no animations. If you think about a typical beamer slide such as

\begin{frame}
  \frametitle{Some things I did}
  \begin{itemize}[<+->]
    \item One
    \item Two
    \item Three
  \end{itemize}
\end{frame}

the resulting PDF will have three pages, each of which contains at least one of the list items, and all of which have the same title. For tagging/reuse, we almost certainly don’t want that: we want the ‘flattened’ version (as you’d have in an handout).

That looks OK if you take a simple example, but once authors start using overlays to swap one image for another ‘in place’, etc., it’s no longer clear what the right tagging might be. So part of the overall project here will be about working with users to explore what the answers are.

LearnLaTeX in more languages

2024-12-04T00:00:00+00:00

We launched learnlatex to provide high-quality teaching of core LaTeX concepts with interactive examples. From the start, we wanted to offer the ‘course’ in multiple languages: this was part of the structure from the start.

Over time, several people have done the hard work of translation to several languages. Recently, Daniel Zhang has completed translations to simplified and traditional Chinese. This meant working on some technical aspects that don’t come up automatically for e.g. western European languages. Many thanks to Daniel for the hard work.

There are a few issues suggesting people started on other translations. It would be great if interested (native) speakers took on the challenge of finishing for example Japanese, Turkish and Korean translations, or started on other languages.

The mythical LaTeX3

2024-11-11T00:00:00+00:00

When the LaTeX Project team took over maintenance of LaTeX from Leslie Lamport, LaTeX2e was planned as an intermediate release to resolve immediate issues with the then-current LaTeX2.09. The ‘e’ (or properly ε) was meant to indicate a small change from LaTeX2.09, with longer-term plans centred on LaTeX3. As we know, that proved rather hopeful, and we are still using LaTeX2e today: at least, that’s how it still announces itself.

The 1990s plans

The plans for LaTeX3 developed by the team in the early 1990s envisaged reimplementing LaTeX with a much cleaner underlying programming framework and well-defined separation of user interface, design functions and programming. A lot of these ideas were prototyped very early on, with a version of the programming layer created well before LaTeX2e was finalised. But performance was the big issue: yes, you could test things as a developer, but that was basically it. At the same time, some ideas were not fully ready: before e-TeX (pdfTeX extensions, etc.), there was less that the engine could do, and whilst Knuth’s TeX is an excellent piece of software, the LaTeX3 ideas pushed it to the limit.

So LaTeX2e was released in 1994, and development focussed on packages for many years. That meant that ‘LaTeX3’ became something of an amusing idea: talked about but with no likelihood of (visible) delivery.

Development into the 2010s

Development of ‘LaTeX3’ ideas never stopped, of course: it was just that they moved from being primarily targeting a new kernel to building on top of LaTeX2e. That results in a bundle called l3in2e and one called xpackages on CTAN, although both were marked as experimental.

When I joined the LaTeX Project team in 2009, that’s largely where things were. However, the landscape had evolved: computers had become more powerful, e-TeX and pdfTeX extensions to TeX were very widely available and development was easier. That meant that LaTeX3 ideas, particularly the programming language (expl3) was usable in real documents: the fontspec package in particular was written in expl3 and was quite usable. I picked that up and wrote the second version of siunitx in expl3.

At the same time, some of my first work on the team was in getting expl3 to the point that the ‘experimental’ was no longer really true: you could use it in a package and be happy that the code would continue to work between releases. The other major task early in my team career was getting us into a position that releases, first of expl3 and then of the LaTeX2e kernel, were easy to make from whatever platform we wanted: Linux, macOS and Windows.

Over the 2010s, this meant the team could round out ideas that had been suggested first over 20 years before. Development was still as packages adding on to the kernel, but xparse (document commands), xtemplate (flexible document structures) and xgalley (a new approach to the main vertical list) were all revised and testable.

The 2020s: LaTeX2e 2020-02-02 and beyond

By 2020, work on the core idea of a programming layer had reached the point that the code was stable and performant, and that the team wanted to be able to rely on it for kernel developments. So we took the step to integrate loading expl3 into the LaTeX2e kernel, with the rather cryptic description ‘Improved load times for expl3’ in LaTeX News. Following that, xparse and xtemplate have also been (largely) added to the kernel, tightening up the parts that worked from those that do turn out to have been purely experiments.

With this functionality available, new ideas such as hook management and re-implementation of older ones such as lists is possible: some of that is currently only activated if you use \DocumentMetadata as a ‘marker’. For updated documents, that means many of the LaTeX3 ideas are being used: the programming layer, a way of making document commands (\NewDocumentCommand originally from xparse) and templates (originally from xtemplate and used to update list code, etc.).

There are also more subtle changes across the kernel, for example updating commands to use UTF-8 everywhere, make more functions engine-robust, etc. Put together, the 2024 LaTeX2e is quite different from the 1994 one: but you can still rely on the long-term stability that’s always been there.

Conclusions

Whilst there will never be a stand-alone ‘LaTeX3’ format, the ideas that were first explored by the LaTeX Project team in the early 1990s are now available to users in the LaTeX kernel. They are being used to deliver an updated LaTeX capable of producing accessible documents, and more widely to bring customisation to the kernel. So whilst ‘LaTeX3’ might not be something you’ll ever say you use, if you use LaTeX, you are getting those features today.

Engine news from the LaTeX Project

2024-11-05T00:00:00+00:00

The latest release of LaTeX went to CTAN on Friday, and moves us forward in truly automatic tagging for PDFs, particularly for mathematics. As part of the work, we have been looking at the capabilities of different engines. Here, I want to summarise what users should take from that for existing and for new documents.

LuaTeX

LuaTeX offers the widest range of features, including the ability to access the node list as it’s constructed. We can use that to generate MathML automatically for math mode fragments. It also allows us to do tagging in fewer passes than in other engines and offers support for larger documents (due to dynamic memory allocation). As such, LuaTeX is the recommended engine for all new documents.

Work is continuing on adjusting parts of the setup to improve automation here: for the present, you should be using

\usepackage{unicode-math}

to get the best results for math mode.

XeTeX

The situation with XeTeX contrasts strongly with LuaTeX: although both are Unicode engines, XeTeX has none of the flexibility available in LuaTeX. There are significant technical limitations which mean that XeTeX cannot create accessible PDFs at all: this is unlikely to change. The engine itself is unmaintained, and with no access to internals, fixing the current issues alone would not really help. XeTeX does not produce PDFs directly, and that essentially rules out full tagging support (see below).

All of this means it is time to move away from XeTeX: certainly for new documents, and even for existing ones. Users should look at alternative approaches here, even if it means some source changes. This is most obvious for users of xeCJK, who will need to look a the methods offered by luatexja instead: whilst the latter is described as for Japanese, the underlying mechanisms should be suitable for other East Asian languages.

pdfTeX

pdfTeX is widely used as it is superbly stable and fast, at least if you are dealing with languages where an 8-bit approach works. There are as a result a vast body of existing LaTeX documents which rely on pdfTeX. As such, the LaTeX Project are working to produce fully tagged PDFs from these sources, although we cannot do quite the same job as with LuaTeX.

For new documents, moving to LuaTeX is the recommended approach: it will continue to offer the most complete tagging support. For existing documents, tagging is available and we hope to improve it further: this may eventually use a LuaTeX-based approach emulating 8-bit approaches. At present, however, tagging in pdfTeX is more limited than in LuaTeX.

Other engines

There are other engines, most notable (u)pTeX. Like XeTeX, these do not generate PDF directly, so may well be more limited in tagging support. Unlike XeTeX, other engines do have (small) support teams and so may see developments that help with tagging. However, the LaTeX Project team do not test these engines, and so where possible a move to LuaTeX is suggested.

Conclusions

The time to move to LuaTeX for new LaTeX documents is here. For existing pdfTeX sources, tagging will be available, although it may be more limited than for LuaTeX. pdfTeX users can keep their existing sources and will see tagging available. XeTeX users really do need to re-work their sources for LuaTeX.

Tagging project progress

2024-09-07T00:00:00+00:00

The LaTeX Project have been working for about 4 years now on creation of automatically tagged PDFs from (more-or-less) unmodified LaTeX documents. We’ve been reporting regularly in LaTeX News, and things are moving forward nicely.

We (Frank, Ulrike and I) went to the recent DocEng 2024 conference in San José. Ulrike presented a workshop showing what is doable right now with the updated code. At present, this is still an opt-in testphase, but most of the time

\DocumentMetadata{testphase = {phase-III,firstaid,math,table,title}}

is enough to use it. As you can probably tell, this uses the current (phase III) code plus add-ons for maths, tables and titles: that is all a bit more experimental but is worth exploring.

We are building up an idea of which packages and classes work out-of-the-box with the latest code: you can view the current list online. It’s amazing how many packages already work, but of course there is work to do. I’ve got some adjustments to make for siunitx, but as the tagging structures are still not finalised, I’m waiting a bit. We also need to look at performance: tagging takes time, but when we make it opt-out we need it to be as fast as possible.

On the to-do list here is beamer: a complex class with a lot of challenges! I think that deserves it’s own post, so I’ll return to that soon in a dedicated post.

But for day-to-day use, I’m finding tagging something I can mainly switch on and just use. So other than slides, all of my LaTeX work is now tagged - may not perfectly just yet, but each release getting better and better.

Get the Jag: from x-type to e-type

2023-10-10T00:00:00+00:00

Controlling expansion has been a key part of expl3 from day one. A basic expl3 function name such as \foo:nn shows how many unmodified braced arguments it takes: so called n-type arguments. We can then create variants, which can lead to expansion only once (o-type), to the value of a variable (V-type) or to the value retrieved by constructing the name of a variable and then finding the value (v-type). We can do the same with single-token (N-type) arguments, which are often themselves functions and can be given as a constructed name (c-type).

How about exhaustively expanding all of the tokens in an argument? To date, that has been handled by x-type expansion. This uses \edef behind-the-scenes, so the experienced TeX programmer will see that it cannot itself work in an expansion context. Using \edef also has the side effect that # tokens need to be doubled in the input.

Enter `\expanded`

A little while ago now, the LaTeX Team arranged for a ‘new’ primitive \expanded to be added to the major TeX engines. This works almost in the same way as \edef except that it is itself expandable and it does not require # tokens to be doubled. Using this primitive, we added e-type expansion to expl3, and have used it for creating variants of expandable functions.

That left us with two almost-identical variants and a tricky task giving an explanation of which to use, as there are places we want e-type expansion even if the underlying function isn’t expandable (where that # doubling business is an issue). In particular, with a bit of care for a few edge cases, it turns out that everything that is set up for x-type expansion can be converted to e-type. That includes things like \cs_set_nopar:Npx, where when you look closely we should have called it \cs_set_nopar:Npe from the word go: there’s no # doubling as this is just \edef renamed.

The pivot

So we’ve now made the decision to pivot toward e-type expansion across the board. We’ll be retaining the (now deprecated) x-type variants that are already in expl3, but the documentation and all new variants will only be e-type. Once the new release is out, package authors are encouraged to move all of their x-type usage to e-type. The timeframe will of course depend on the stability approach of individual package authors: for siunitx, I’m simply going to step the minimum required expl3 release and be done with it, but others may be more cautious.

What is important here is that almost all users should see minimal impact: provided the installed expl3 core files match, there should be no obvious change for end users. What we gain, though, at the code level is a lot more consistency and clarity of design choice.

One last variant

That leaves one additional variant: f-type, which is almost like e-type but stops at the first non-expandable token. It exists largely as we needed something expandable before we had e-type, but it still has a few edge use cases. So we won’t be dropping it, but in almost all cases code using f-type expansion can move to e-type. Again, I’ve done that in siunitx and will do a sweep over the expl3 core soon. So we can expect to see a move to almost no use of f-type other than some specialist low-level places.

A (more) consistent variant set

A key driver in the tidy up here is that we would like to provide as far as possible pre-defined variants for the core expl3 functions. That means having some way of avoiding a combinatorial explosion: the more variants we need, the more this is an issue. So we are aiming to get the ‘core’ set to n, V, v and e, and N and c, with o and f where they are required. The latest expl3 release fleshes out more pre-defined variants for this set, and we expect that to grow a little more as we try to standardise more functions around this core set.

Programmer action

The keen expl3 programmer is likely wondering what they need to do in detail. Working on the basis that you are already requiring the expl3 release with these changes (2023-10-10), then

All x-type variants provided by expl3 have now got a matching e-type, so you can simply change the naming unless …
… you have doubled # tokens in an argument, in which case you also need to undouble them
Replace your own x-type variants with e-type ones for internal code
For your own documented variants, decide if you will retain, deprecate or remove x-type

As an example of the point on # tokens, you might currently have something like

\use:x
  {
    \cs_new:Npn \exp_not:N \mypkg_foo:w ##1 \c_colon_str ##2 \c_underscore_str
      {
        % Code using ##1 and ##2
      }
  }

which would need to change to

\use:e
  {
    \cs_new:Npn \exp_not:N \mypkg_foo:w #1 \c_colon_str #2 \c_underscore_str
      {
        % Code using #1 and #2
      }
  }

Of course, if there are no doubled # to worry about, it’s really just a search-and-replace. So we can all now get on and use the ‘Jag’!

Uncertainties in siunitx

2023-08-02T00:00:00+00:00

Right from the first version, siunitx has supported uncertainty values in numbers. Uncertainties are a key piece of information about a lot of scientific values, and so it’s important to have a convenient way to present them.

The most common uncertainty we see is one that is symmetrical, a value plus-or-minus some number, for example 1.23 ± 0.04. This could be a standard deviation from repeated measurement, or a tolerance, or derived some other way. Luckily for me, the source of such a value doesn’t matter: siunitx just needs to be able to read the input, store it and print the output. For both reading and printing, siunitx has two ways of handling these symmetrical uncertainties

A ‘long’ form, in which the uncertainty is given as a complete number, for example 1.23 ± 0.04
A ‘compact’ form, in which the uncertainly is shown relative to the digits in the main value, for example 1.23(4)

In version 3 of siunitx, I took that existing support and added a long-requested new feature: rounding to an uncertainty. That means that if you have something like 1.2345 ± 0.0367 and ask to round to one place, the uncertainty is first rounded (to 0.04), then the main value is rounded to the same precision (to 1.23).

Building on that, v3.1 added the idea of multiple uncertainties. These come up in some areas (astronomy is one, particle physics another) where there are clear sources of distinct uncertainty elements. Supporting multiple uncertainties also means supporting descriptions for them: if you are dividing up uncertainty, you likely want to say why. So in v3.1, you can say 1.23(4)(5) or 1.23 ± 0.04 ± 0.05, and set up the descriptors, and have something like 1.23 ± 0.04 (sys) ± 0.05 (stat) get printed. I’ve not had any feedback yet on this new feature: fingers-crossed that means it all works 100%!

Now, for v3.3, I’ve looked at another long-standing request: asymmetric uncertainties. For this release, I’ve kept this area simple, as it’s one I know less about. There’s just a ‘compact’ input form, and one (compact) output form. So we can input 1.23(4:5) and get in TeX terms $1.23^{+0.04}_{-0.05}$ typeset. Asymmetric and symmetric uncertainties can be intermixed, and you can have multiple asymmetric ones. I’m hoping this feature gets picked up by users, and that I get some idea of what to do next. I suspect there might be alternative output formats requested, and I wonder whether a ‘long’ input form 1.23 + 0.04 - 0.05 will be asked for: I’ve not done that yet as it’s more tricky if the user misses one part out!

Hopefully, with the introduction of asymmetric uncertainty support, siunitx covers just about all types of uncertainty in scientific data: aiming to be a comprehensive (SI) units package, after all!

Mapping to characters

2022-10-04T00:00:00+00:00

It is quite natural to think that separating a word up into individual characters is quite easy. It turns out that for the computer this isn’t really the case. If we look at a system that understands Unicode (like XeTeX or LuaTeX), most of the time one ‘character’ is stored as one codepoint. A codepoint is a single character entity for a Unicode programme. For example, if we take the input café, it is made up of four codepoints:

U+0063 (LATIN SMALL LETTER C)
U+0061 (LATIN SMALL LETTER A)
U+0066 (LATIN SMALL LETTER F)
U+00E9 (LATIN SMALL LETTER E WITH ACUTE)

So we could in XeTeX/LuaTeX use a simple mapping to grab one character at a time and do stuff with it. However, that’s not always the case. Take for example Spın̈al Tap. The dotless-i is a single codepoint, but there is not a codepoint for an umlauted-n. Instead, that is represented by two codepoints: a normal n and a combining umlaut. As a user, it’s clear that we’d want to get a single ‘character’ here. So there’s clearly more work to do.

Luckily, this is not just a TeX problem and the Unicode Consortium have thought about it for us. They provide a data file and rules that describe how to divide input into graphemes: ‘user perceived characters’. So ‘all’ that is needed is to examine the input using these rules, and to divide it up so that ‘characters’ stay together.

For pdfTeX, there’s an additional wrinkle: it uses bytes, not codepoints, and so if we use a naïve TeX mapping, we would divide up any codepoint outside the ASCII range into separate bytes: not good. Luckily, the nature of codepoints is predictable: all that is needed is to examine the first byte and collect the right number of further bytes to re-combine into a valid codepoint.

This work isn’t something the average end user wants to do. Luckily, they don’t have to as the LaTeX team have looked at this and created a suitable set of expl3 functions to do it: \text_map_function:nN and \text_map_inline:nn. So for example we can do

\ExplSyntaxOn
\text_map_inline:nn { Spın̈al ~ Tap } { (#1) }
\ExplSyntaxOff

and get

(S)(p)(ı)(n̈)(a)(l)( )(T)(a)(p)

in any TeX engine (assuming we are set up to print the characters, of course).

Taking a more ‘serious’ example (And one that is going to use LuaTeX for font reasons), we might want to map over Bangla text. It’s easy to do that with the expl3 function \tl_map_inline:nn, but it gives very odd results. In contrast, \text_map_inline:nn divides up the characters correctly.

\documentclass{article}
\usepackage{fontspec}
\newfontface\harfbengali
  {NotoSansBengali-VariableFont_wdth,wght.ttf}[Renderer=HarfBuzz,Script=Bengali]
\begin{document}
\harfbengali
\ExplSyntaxOn
ন্দ্রকিন্দ্র
\par
\text_map_inline:nn{ন্দ্রকিন্দ্র}{(#1)}
\par
\tl_map_inline:nn{ন্দ্রকিন্দ্র}{(#1)}
\ExplSyntaxOff
\end{document}

which gives (You’ll need Noto Sans Bengali available to make this work locally.)

So, as you can see, mapping to ‘real’ text is easy with expl3: you just need to know that the tools are there.

Math mode: $…$ vs $…$

2022-09-13T00:00:00+00:00

A topic that comes up for many LaTeX users is how best to mark up math mode in sentences: inline math mode. LaTeX offers three (!) official ways to do that

$...$
$...$
\begin{math} ... \end{math}

The last version is clearly far too verbose for routine use, but the first and second approaches have a much less clear-cut division.

Plain TeX uses the $...$ construct exclusively, and that means many experienced (La)TeX users simply use this without any further consideration. There are good arguments in favour of the syntax, most obviously that this switches directly into math mode (it uses the underlying TeX idea of category codes with no macro expansion required). On the other hand, it lacks any possiblity of matching begin and end points.

LaTeX’s $...$ syntax was introduced by Lamport early in the development of the format. Using separate begin and end marks means that is does allow error detection in the editor, and it also is linked visually to LaTeX’s display math \[...\] approach. (More on that below.)

So which one to use? Experience suggests that whilst Lamport made many good decisions in the design of LaTeX’s input syntax, $...$ wasn’t the best of them. The number of times that pair-matching is helpful simply doesn’t compete with the extra complexity of the input. Unlike $ symbols, parenthesis are very common in math mode input, so using $...$ is likely to lead to confusion. At the same time, there’s no difference in the results between the two syntax, so there isn’t a downside to using $..$ . So I (and the majority of the current LaTeX team) favour using $..$ .

I think it’s important to contrast with (unnumbered) display math mode. There, the LaTeX \[...\] syntax is the officially-supported approach, and the plain TeX $$..$$ is not. For display math, there are significant differences in what can be done using \[...\] compared to directly switching to TeX’s display math mode using $$..$$, and so the situation is clear: use \[...\].

siunitx v3.1: complex values

2022-03-21T00:00:00+00:00

I mentioned recently that I’m working on features for siunitx v3.1. One area that I’ve now been able to commit is improvements to handling complex values.

In v2, you could give complex values in the normal argument to \num or \SI. I removed that for v3, and of course that was not entirely popular. Instead, I introduced dedicated commands, \complexnum and \complexqty. Part of the reason for that was that it makes the implementation of \num and \qty/\SI easier. But the other was that I wanted to address polar form, and that really didn’t look viable if it was mixed in with the normal numerical argument type.

I’ve now committed a change that introduces support for polar form in siunitx. So what happens now is if you give a value such as \num{10:30}, it’s treated as a magnitude and an angle. The latter has a setting to determine if it’s regarded as being in degrees or radians. The package can then typeset the result in a similar form, using the \angle symbol between the two parts. You can also set up to convert between the classical (Cartesian) and polar forms of the value. So hopefully this shows why I wanted to separate out complex numbers: they need special handling, and now they get it.

Some TeX Developments

The tagging project and beamer

The nature of presentations

Work so far

Future plans

Document structure

LearnLaTeX in more languages

The mythical LaTeX3

The 1990s plans

Development into the 2010s

The 2020s: LaTeX2e 2020-02-02 and beyond

Conclusions

Engine news from the LaTeX Project

LuaTeX

XeTeX

pdfTeX

Other engines

Conclusions

Tagging project progress

Get the Jag: from x-type to e-type

Enter \expanded

The pivot

One last variant

A (more) consistent variant set

Programmer action

Uncertainties in siunitx

Mapping to characters

Math mode: $…$ vs \(…\)

siunitx v3.1: complex values

Enter `\expanded`