Last edited on 1998-07-15 04:19:23 by stolfi

Reeds-compressed Voynichese

The samples below were derived from the Biological section of the Voynich manuscript by a compression process suggested by Jim Reeds: repeatedly look for the most common digraph, and replace it by a new symbol. (You can see here some scripts that implement this transformation.)

The original alphabet was basic EVA (lower-case letters only). For new symbols, we used the uppercase letters A,B,C, in order. The comments in each file list the substitutions performed at each stage.

Word-based compression

These texts were compressed on a word by word basis, i.e. the substituted digraphs did neither include nor span word breaks.

Paragraph-based compresion

These texts were obtained by compressing a whole paragraph at a time. Word breaks were replaced by hyphens, and handled like any other letter. Paragraph breaks, however, were still treated as hard barriers.