Skip to main
Article
Style written on sketchbook page

Generating Code Documentation for Polyglot Projects

Code documentation is ideally written as close to the actual code as possible, but compiled into a comprehensive set of documentation that includes code from all languages in use. Here’s how we intend to do that.

At OddBird we believe that undocumented code is unfinished code. The act of documenting clarifies what we are building, and the resulting documentation guides consistency in how we do things in different parts of a project built by different team members at different times.

In a typical project, one of our documentation deliverables is a living style guide which serves as documentation of the visual elements used in the app as well as guidelines for and examples of usage. The style guide is built automatically whenever we build the code, and it is delivered as a static HTML website which can be served alongside the app. For example, the style guide for this website can be found here:

We are currently building our style guides using a tool called SassDoc which compiles the documentation based on special comments written inline in our stylesheets. For example, the triple-slash commented lines in the following Sass:

// Selection
// ---------
/// Selected text is highlighted in orange,
/// with any text-shadows removed.
/// @group typography
::selection {
  @include contrasted('select');
  text-shadow: none;
}

is rendered in the style guide like this:

syntax highlighted code block and description of selector

We are working on our own theme for Sassdoc, called Herman, which provides extra tools for rendering samples of things like colors, fonts, and icons. (Alas, while we are using it to produce documentation, it is not yet very well-documented itself.)

Generating documentation from inline comments like this is ideal for developer-oriented documentation (that is, documentation of how the system is built rather than how to use it). It keeps the documentation right next to the code that it documents, which both makes it more accessible and increases the chances that it will be kept up to date.

But SassDoc only knows how to read comments from Sass. And the projects we work on implement patterns using multiple languages working together: Sass for stylesheets, Nunjucks/Jinja2 for markup, JavaScript for interactivity, Python on the backend. How can we generate documentation for a pattern that involves multiple languages, without giving up on the goal of writing documentation inline?

The Python documentation tool Sphinx has a nice pattern for this: rather than only fetching documentation from one particular kind of code source file, it allows for writing free-form documentation organized in whatever way makes sense, but with the ability to use “autodoc” directives to pull in documentation from inline source comments wherever makes sense. For example, this directive would add documentation generated from the code and comments in the rstblog module:

.. automodule:: rstblog

But this approach still suffers from the single-language problem! Sphinx’s autodoc extension is focused on Python code. And while it is extensible, there is a challenge in creating good autodoc extensions for other languages: different languages use different syntaxes, so need to be parsed by a tool that understands the language. But often a high-quality parser of a particular language is not available in the Python ecosystem. So Sphinx autodoc extensions to pull in inline documentation from other languages are not consistently available or well-maintained.

In order to provide more flexibility, I propose tackling this challenge using a decoupled architecture: a central documentation formatter that parses source code using separate processes.

The central formatter would work similarly to Sassdoc or Sphinx: read a file that specifies the overall structure of the documentation and look for directives that ask to include automatic documentation from other source files.

A parsing utility would have the limited responsibility of reading a source file of one particular type. The central formatter would run the parser as a separate process and output a JSON representation of the code structure and comments for the central formatter to make use of. This way the parsing utility can be written in whatever language best supports parsing the source language.

As a proof of concept, in the near future we intend to add a feature to Herman to automatically include documentation of macros from Nunjucks templates. Stay tuned!

If you have thoughts about how to make documentation better, or if you’d like to hire us to help set up a living style guide or other tools, please get in touch.

Recent Articles

  1. Stacks of a variety of cardboard and food boxes, some leaning precariously.
    Article post type

    Setting up Sass pkg: URLs

    A quick guide to using the new Node.js package importer

    Enabling pkg: URLs is quick and straightforward, and simplifies using packages from the node_modules folder.

    see all Article posts
  2. Article post type

    Testing FastAPI Applications

    We explore the useful testing capabilities provided by FastAPI and pytest, and how to leverage them to produce a complete and reliable test suite for your application.

    see all Article posts
  3. A dark vintage accessory store lit by lamps and bare bulbs, with bags, jewelry, sunglasses and a bowler hat on the wooden shelves and carved table.
    Article post type

    Proxy is what’s in store

    You may not need anything more

    When adding interactivity to a web app, it can be tempting to reach for a framework to manage state. But sometimes, all you need is a Proxy.

    see all Article posts