Last edited on 1998-07-16 06:48:02 by stolfi
Latin
- Latin I
Cæsar's De Bello Gallico
[ full ]
[ page ]
The overall color is similar to that of
modern Romance languages; but the word endings in
-u- --- the characteristic -um,
-us, -unt, etc. --- seem to be
somewhat more predictable.
Note the dark spot in the
ll of Gallia, and the bright
spots on non-Latin "words" --- e.g. the -x in
the name Orgetorix, the y in
Pyrenaeos, the word space after Pisone's
abbreviated first name, and the numerals on the last line.
Note also the bright spot on the que ending
(which usually means and, but is written
after the conjunct and attached to it).
l = 1 r = 0
[ colorized page ]
[ bits per tuple ]
l = 2 r = 0
[ colorized page ]
[ bits per tuple ]
l = 3 r = 0
[ colorized page ]
[ bits per tuple ]
l = 1 r = 1
[ colorized page ]
[ bits per tuple ]
- Latin II
Rule of the Benedictine monks
[ full ]
[ page ]
This text looks pretty much like
Cæsar's. Again we see bright spots on word-initial
b. But note the highly predictable
non, very proper for a strictly regulated
order...
l = 1 r = 0
[ colorized page ]
[ bits per tuple ]
l = 2 r = 0
[ colorized page ]
[ bits per tuple ]
l = 3 r = 0
[ colorized page ]
[ bits per tuple ]
l = 1 r = 1
[ colorized page ]
[ bits per tuple ]
- Latin III
A theological text by William of Ockam
[ full ]
[ page ]
More of the same. Actually, this text
is more repetitive than the other two; note for
example de dark fide- stem (English
faith-, IIRC). Indeed, its entropy is
significantly lower (h3 = 2.56,
against the 2.6-2.8 of other latin texts).
l = 1 r = 0
[ colorized page ]
[ bits per tuple ]
l = 2 r = 0
[ colorized page ]
[ bits per tuple ]
l = 3 r = 0
[ colorized page ]
[ bits per tuple ]
l = 1 r = 1
[ colorized page ]
[ bits per tuple ]