Recomputed the word distributions, adding the average positions and deviation, and using only good words (so that the blocks would be more uniform):. cat bio-j-jsa-gut.wds \ | sed \ -e 's/[ql]j/H/g' \ -e 's/[ql]g/P/g' \ -e 's/ij/k/g' \ -e 's/ix/e/g' \ -e 's/is/r/g' \ -e 's/iiu/n/g' \ -e 's/cy/a/g' \ -e 's/ci/a/g' \ -e 's/in/m/g' \ -e 's/ir/w/g' \ -e 's/cs/z/g' \ -e 's/cg/8/g' \ | enum-words-in-blocks -vWPB=100 \ | sort +1 -2 +0 -1n \ | make-word-location-map -vCTWD=1 -vPERCENT=1 -vNBLOCKS=47 \ > .baz Here are the table lines for the most popular words (at least 20 occurrences) sorted by length (L) and total frequency: TOTAL AVG DEV L WORD ABSOLUTE FREQUENCY BY BLOCK RELATIVE FREQUENCY BY BLOCK ----- ----- ----- - ---------------- ----------------------------------------------- ----------------------------------------------- 127 24.7 12.2 2 oe 15.66...2.11.12.3134926354.623697511211.4..2361 00.00...0.00.00.0000100000.000010000000.0..0000 40 21.4 12.9 2 or 21.121..4........2124411.2...22.2.......1.121.1 00.000..1........0001100.0...00.0.......0.000.0 35 22.8 15.1 2 8a 222.2..1.2...11..1111....1...21.3.2112...11..3. 111.1..0.1...00..0000....0...10.1.1001...00..1. 20 21.7 11.5 2 am ...1.11..1.12......2..11.3..1.2....1...1......1 ...0.00..0.01......1..00.1..0.1....0...0......0 81 22.9 12.7 3 qoe .4.441.1111.3.341..553.13.214123442211...141.12 .0.000.0000.0.000..110.00.000000000000...000.00 73 26.3 13.1 3 8am ..1.3..1.3.322334.611.21....12..7224.12.2312224 ..0.0..0.0.000000.100.00....00..1000.00.0000000 51 22.8 16.1 3 8ar 125312111.....12.122.11.2.1...1.121.2.1..12721. 001100000.....00.000.00.0.0...0.000.0.0..00100. 50 21.0 13.3 3 8ae 1131211.1.11.13341142.....1..2..111221.21.3...1 0010000.0.00.01110010.....0..0..000000.00.1...0 31 23.7 14.0 3 zam 1....1..211412..11.1.1.1....1...1..1.1132...21. 0....0..100101..00.0.0.0....0...0..0.0011...10. 25 23.3 11.4 3 oHa ..1....221.1......21....213..1.2.11..21....1... ..0....110.0......10....101..0.1.00..10....0... 25 26.7 12.7 3 zoe 11.1.......11.....1.1.12..121.1.1.1...21311.... 00.0.......00.....0.0.01..010.0.0.0...10100.... 23 23.1 13.4 3 oea 21..1..1.1.........11.2211.1...22...1.....1.11. 10..0..0.0.........00.1100.0...11...0.....0.00. 79 24.4 13.6 4 qoHa 521311.21..3111.32...2.31352161.141.62323..222. 100000.00..0000.00...0.00010010.000.10000..000. 69 22.8 14.7 4 zcca 62.21.16111.2..22...224.2.2114311.1.12212121321 10.00.01000.0..00...001.0.0001000.0.00000000000 67 24.3 13.6 4 ccca .2.3.154.2213.1.1....21..111332.4123236121111.. .0.0.011.0000.0.0....00..000000.1000001000000.. 39 25.6 12.4 4 oHae ..2.2...21.....1.1111321.11214..21...211...1.22 ..0.0...00.....0.0000100.00001..00...000...0.00 37 24.7 11.8 4 oHam ..1.11.1...3.211..1.112131.123..1....11.2311... ..0.00.0...1.000..0.000010.001..0....00.0100... 35 15.4 12.6 4 oHar 21511.13221......211.1..2111..12.1.........1.1. 10100.01110......100.0..1000..01.0.........0.0. 25 25.9 15.0 4 Hc8a 1..3........1....4112....1..1..1.....1...112121 0..1........0....1001....0..0..0.....0...001010 21 21.8 14.1 4 oHca 1.1..4.....1.....2..11...2..1..21..1........21. 0.0..2.....0.....1..00...1..0..10..0........10. 204 25.0 14.4 5 zcc8a 26431595524383364713542211463334552525467789574 00000000000000000000000000000000000000000000000 172 22.7 13.8 5 ccc8a 46.1438463556719332362.413133235343254436532272 00.0000000000001000000.000000000000000000000000 113 25.8 12.3 5 qoHae 221411.311...31955...32.148.5432.45.665118312.. 000000.000...00100...00.001.0000.00.000001000.. 91 25.3 11.4 5 qoHam ......221..3874251.5.1111114521.1.74.362.433... ......000..0110000.0.0000000000.0.10.010.000... 83 25.2 16.4 5 oHc8a .15421.23432.211.417...111.....2211....32.31994 .01000.00000.000.001...000.....0000....00.00110 54 13.6 10.6 5 qoHan 8214212....11621426111.1..11..1..11.1........1. 1001000....00100101000.0..00..0..00.0........0. 48 23.2 13.4 5 qoHar 4..23.3....1........171..34211..121.222.11111.. 1..01.1....0........010..11000..000.000.00000.. 43 21.7 14.4 5 qoHca .22.1243...1.1.2..1.22.1.12..112111.3.....11112 .00.0011...0.0.0..0.00.0.00..000000.1.....00000 34 23.8 12.9 5 oHcca 1...2122.1.1.......1..1233.1...3.311.....2111.. 0...1011.0.0.......0..0111.0...1.100.....1000.. 31 25.0 12.0 5 cccca 1..11....1...23.11..1.11.1..31.1.131.21.11...1. 0..00....0...11.00..0.00.0..10.0.010.10.00...0. 23 19.8 9.9 5 zccca ....1.1.2111.11.1..11.21211.1......2......1.... ....0.0.1000.00.0..00.10100.0......1......0.... 21 25.4 9.9 5 qoHoe ..........1.3.1..11..1.12...1311....11.....11.. ..........0.1.0..00..0.01...0100....00.....00.. 198 24.0 14.2 6 qoHc8a 41946411238.556699393.1271123..5345284285377583 00000000000.000001000.0000000..0000000000000000 81 25.3 12.7 6 qoHcca .11.244.11111311222..2.355.4.113.1264.31224.211 .00.000.00000000000..0.011.0.000.0010.00000.000 56 22.4 13.7 6 oHcc8a 111.22.128.11....113111335....1211..1.1..11.62. 000.00.001.00....000000001....0000..0.0..00.10. 52 21.7 12.6 6 eccc8a 11.1..4111331311221.21.2.....3..1.134.242...... 00.0..1000110100000.00.0.....1..0.011.010...... 50 23.0 12.8 6 cccHca 31...1321.1.12.11.11.212.25.2.22211.111..1221.. 10...0100.0.00.00.00.000.01.0.00000.000..0000.. 37 24.7 14.8 6 zccHca ..2.1.251.1.....1...211...222.2...1....2.113211 ..0.0.010.0.....0...000...000.0...0....0.001000 36 21.3 12.9 6 zccc8a 11.21.1..4.1.11.21..3.122...2..2..11..11.2..2.. 00.10.0..1.0.00.10..1.011...1..1..00..00.1..1.. 21 24.7 13.5 6 ezcc8a ...1..1.1.12..111...1.....1..2...2..2...1..11.1 ...0..0.0.01..000...0.....0..1...1..1...0..00.0 183 21.4 13.3 7 qoHcc8a 449.44212683996225532.3863523222.1584335343353. 001.00000000100000000.0000000000.0000000000000. 35 24.5 11.4 7 ccccHca .1...11..121.1.1111..1..213611...1...1111..2.1. .0...00..010.0.0000..0..101200...0...0000..1.0. 31 23.6 12.2 7 zcccHca .1.21.1..112....1.1.......4521....2.11.21.1.... .0.10.0..001....0.0.......1110....1.00.10.0.... 23 20.4 14.8 7 oeccc8a ..1131..2.2....2...11.....1...1...2..11.1.....2 ..0010..1.1....1...00.....0...0...1..00.0.....1 Recomputing the coarse table: cat bio-j-jsa-gut.wds \ | sed \ -e 's/[ql]j/H/g' \ -e 's/[ql]g/P/g' \ -e 's/ij/k/g' \ -e 's/ix/e/g' \ -e 's/is/r/g' \ -e 's/iiu/n/g' \ -e 's/cy/a/g' \ -e 's/ci/a/g' \ -e 's/in/m/g' \ -e 's/ir/w/g' \ -e 's/cs/z/g' \ -e 's/cg/8/g' \ | enum-words-in-blocks -vWPB=666 \ | sort +1 -2 +0 -1n \ | make-word-location-map -vCTWD=3 -vPERCENT=1 -vNBLOCKS=7 \ > .bar Results posted in my Voynich page. For comparison, let's try English and Portuguese: cat engl.wds | tr '[A-Z]' '[a-z]' | head -4661 \ | enum-words-in-blocks -vWPB=100 \ | sort +1 -2 +0 -1n \ | make-word-location-map -vCTWD=1 -vPERCENT=1 -vNBLOCKS=47 \ > .baz TOTAL AVG DEV WORD ABSOLUTE FREQUENCY BY BLOCK RELATIVE FREQUENCY BY BLOCK ----- ----- ----- ---------------- ----------------------------------------------- ----------------------------------------------- 199 24.3 14.0 the 9.134354855516419225114342267572.41699419576541 0.000000000000000000000000000000.00000000000000 165 23.2 13.0 a 14335233324554364463275662314144313.55413634432 00000000000000000000000000000000000.00000000000 117 25.5 13.1 and 21122322313312.3323521..52255444113423313334332 00000000000000.0000000..00000000000000000000000 114 23.4 12.9 of 211242314.3123632464.23432.3141.1262.431842..31 000000000.0000000000.00000.0000.0000.000100..00 114 24.1 14.3 to 233324312221261.2325331321212321332331122246423 000000000000000.0000000000000000000000000000000 105 23.3 13.3 i 356.12.114113.34.312233117213512125221445132..1 001.00.000000.00.000000001000000000000000000..0 80 24.7 13.6 in 32.1.21.12.35132212221321.3221221..22221.324421 00.0.00.00.01000000000000.0000000..00000.000000 59 25.1 12.5 she ..21.214..11112...1311.14332.114222111...12.321 ..00.001..00000...0000.01000.001000000...00.000 58 24.4 15.0 was 1312.11131.221..112213.11.12.3..1.12....125432. 0000.00000.000..000000.00.00.0..0.00....001100. 54 27.0 13.5 her ..2311.2....1112.1.31...313...1..3652..112.1213 ..0100.0....0000.0.10...101...0..1110..000.0001 51 26.1 13.2 that ..122.11111...22..2.3121..21.51.1..11.423.23.2. ..000.00000...00..0.1000..00.10.0..00.101.01.0. 50 22.8 11.3 you ..2..21411..22..11..3.3.144...119.12...3.....1. ..0..00100..00..00..1.1.011...002.00...1.....0. 45 21.7 17.0 had 142361...1.11..1.1.1.1.......32.....11124.1213. 010110...0.00..0.0.0.0.......10.....00001.0001. 43 23.7 15.9 as 11223121.2.1...1..1.1...111.1.1.312.21.1.3.1122 00001000.0.0...0..0.0...000.0.0.100.00.0.1.0000 42 22.3 13.3 my 222..1...121.123.1..11.1111222.2..2....412....1 000..0...000.001.0..00.0000000.0..0....100....0 38 18.6 12.6 he .222311.1.12...1.23..4.1....4.1.2....112...1... .000100.0.00...0.01..1.0....1.0.0....000...0... 38 20.6 13.6 at 11.121111142...112...11..21.2.1111.....112111.. 00.000000010...000...00..00.0.0000.....000000.. 38 24.1 14.4 with 1.11.1.111.2311..32....111.12.1....1..1.2.2.33. 0.00.0.000.0100..10....000.00.0....0..0.0.0.11. 34 19.9 10.9 it 11.....1323.1...131.1115..1..2..21....1...11... 00.....0111.0...010.0001..0..1..10....0...00... 30 24.5 15.4 for .2.212.11.1..1..1........111..1.11.121.2221...1 .1.101.00.0..0..0........000..0.00.010.1110...0 30 26.1 9.4 me .1......1.....11.11.22.2113211..1112..1.12..... .0......0.....00.00.11.1001100..0001..0.01..... 27 23.2 10.3 is ......12..2..1.2..2.22..1212.......12121....... ......01..1..0.1..1.11..0101.......01010....... 27 29.8 12.5 mrs ..1..1.........121.11..2..2.11.1.1..1.1..21231. ..0..0.........010.00..1..1.00.0.0..0.0..10110. 26 14.3 11.7 his .3.324....1..1...22111.......11..2........1.... .1.111....0..0...11000.......00..1........0.... 24 30.9 14.2 we 11.......1.1....2..........2.13..1...2....13.5. 00.......0.0....1..........1.01..0...1....01.2. 23 25.0 12.9 on ...1..2.1.1..1...121.1...2.....11.11.2..1.2..1. ...0..1.0.0..0...010.0...1.....00.00.1..0.1..0. 22 23.2 13.2 be ..2....2....22.1.....112..1.1..1......12.111... ..1....1....11.0.....001..0.0..0......01.000... 21 25.3 13.1 up .1....1.1.21.1........1..1121....111...3..1...1 .0....0.0.10.0........0..0010....000...1..0...0 20 21.9 14.9 an 1.11..1.1.111.11..1........1...2.1.1...1..1..2. 0.00..0.0.000.00..0........0...1.0.0...0..0..1. 20 22.2 12.1 john .1..11..11...111.1.......1111.2..1..112........ .0..00..00...000.0.......0000.1..0..001........ 20 23.3 13.5 but ....2.22..........1..122..1.....111..1....1.11. ....1.11..........0..011..0.....000..0....0.00. cat port.wds | tr '[A-Z]' '[a-z]' \ | egrep -v '^x$' \ | head -4661 \ | enum-words-in-blocks -vWPB=100 \ | sort +1 -2 +0 -1n \ | make-word-location-map -vCTWD=1 -vPERCENT=1 -vNBLOCKS=47 \ > .boh