# Last edited on 2026-01-02 17:41:37 by stolfi HISTORY OF THE SPS TRANSCRIPTION To make my SPS transcription ("starps-U.evt"), I started creating "starps-H.eva" with the SPS part of the VMS, with only the lines from the Takeshi Takahashi transcription (";H>") from page f103r to page f116r line 30. Created a similar file "starps-Z.eva" with Rene Zandbergen's transcription (";Z>"). Checked and revised it thoroughly by reference to the Beinecke 2014 online scans as of 2025-06. (Thus it is no longer "Rene's"!) The format of these transcription files started somewhat similar to that of the EVMT interlinear. The numbering skipped the "titles". It was then converted to the new IVTFF format, mostly. IN particular: For this conversion, the following line numbers had to be changed: f103v: 28a,29-36,36a,37-44 to 29-46. f108v: 24a,25-51 to 25-52. f112v: 44a,45-47 to 45-48. f115r: 36a,37-44 to 37-45. Also, the existing transcriptions of the SPS have four "titles", short lines with anomalous justification: =sairy.ore.daiindy.ytam= =otoiis.chedaiin.otair.otaly= =olchar.olchedy.lshy.otedy= =ytain.olkaiin.ykar.chdar.alkam= The "title" was assumed to be part of the following line that the Scribe skipped at first and then added above that line. In the conversion, it was appended to line . Line was set to empty (actually, with the data field just "="). The other three "titles" were kept as such. They must be excluded by special tests when analyzing paragraphs. INITIAL STARLET NUMBERS Initial stab at inserting star numbers "" and " in parag head lines: ./replace_star_ids.sh < starps-Z.eva > .temp-Z.eva Edited the file "starps-Z.eva" checking and reassigning all parag breaks. UPDATING THE LOCATORS A major step in converting the various files to the IVTFF format was replacing the old-style EVMT 1.6e6 locators <{PAGE}.{UNIT}.{OSEQ};{TRANS}> by the new-style IVTFF locators <{PAGE}.{NSEQ};{TRANS}> now=`yyyy-mm-dd-hhmmss` mkdir -p SAVE/${now} for ifile in starps-{U,H,Z} ; do chmod a-w ${ifile}.eva mv -vi ${ifile}.eva SAVE/${now}/ done The "-Z" version had already been upgraded, so: cp -av SAVE/${now}/starps-Z.eva ./ As for the other two: ln -s ../074/map_locators.sh ln -s ../074/loci-evmt16e6-ivtff.tbl for ifile in starps-{U,H} ; do cat SAVE/${now}/${ifile}.eva | map_locators.sh > ${ifile}.eva done COMPARING VERSIONS Wrote a python3 program "compare_ivtff_files.py" to compare two files, line by line, using an optimal alignment algorithm: make -f compare_ivtff_files.make tag0="Z" for tag1 in U H ; do file0="starps-Z.eva" file1="starps-${tag1}.eva" ofile=".cmp-${tag0}${tag1}.edf" ./compare_ivtff_files.py ${file0} ${file1} > ${ofile} done First run: read 2414 lines from file 0 = starps-Z.eva read 1313 lines from file 1 = starps-U.eva there were 587 loci from file0 missing in file1 read 2414 lines from file 0 = starps-Z.eva read 1655 lines from file 1 = starps-H.eva there were 1 loci from file0 missing in file1 Edited the files starps-Z.eva and starps-U.eva until the last one became a subset of the former (apart from comments). Final run: # read 2421 lines ( 1064 data, 23 pages) from file 0 = starps-Z.eva # read 921 lines ( 476 data, 23 pages) from file 1 = starps-U.eva # 588 loci from file0 missing in file1 # 476 perfectly matching line pairs # 0 imperfectly matching line pairs Saving the current files: now="`yyyy-mm-dd-hhmmss`"; echo "now = ${now}" mkdir -p SAVE/${now} mv -vi starps-U.eva SAVE/${now} cp -av starps-Z.eva SAVE/${now}/starps-Z-actually-U.eva chmod a-w SAVE/${now}/*.eva now = 2025-07-15-200047 renamed 'starps-U.eva' -> 'SAVE/2025-07-15-200047/starps-U.eva' 'starps-Z.eva' -> 'SAVE/2025-07-15-200047/starps-Z-actually-U.eva' Renaming "starps-Z.eva" as "starps-U.evt" to reflect the true culprit and uniformize with note 074. Renaming "starps-H.eva" to "starps-H.evt" for the same reason: mv -vi starps-Z.eva starps-U.evt mv -vi starps-H.eva starps-H.evt chmod u+w starps-{U,H}.evt Replacing ";Z" by ";U" with emacs. MOVING THE FILE Moving starps-U.evt to note 074 since further editing will take place there: mv -vi starps-U.evt ../074/star25e1.evt ln -s ../074/star25e1.evt starps-U.evt RESTORING PAGE HEADER LINES The page header lines were restored in "starps-U.evt". " PARAGRAPH STATISTICS compute_H_U_parag_stats.sh