CryptologyEmpirical Values for Natural Languages |
|
Here are some additional explicit examples.
These examples show some trends:
[For solid statistical inferences of course we should take much more samples. See the next subsections.]
The Polish cryptanalyst Rejewski was the first who successfully broke early versions of the German cipher machine Enigma, see »Cryptanalysis of rotor machines«. He detected that ciphertexts were »in phase« by coincidence counts. It is unknown whether he knew Friedman's approach, or whether he found it for himself.
Friedman's early publications were not classified and published even in France.For example he noted that the two ciphertexts
RFOWL DOCAI HWBGX EMPTO BTVGG INFGR OJVDD ZLUWS JURNK KTEHMbesides having the initial six letters identical also had an increased coincidence between the remaining 44 letters.
RFOWL DNWEL SCAPX OAZYB BYZRG GCJDX NGDFE MJUPI MJVPI TKELY
Exercise: How many coincidences would you expect for independent texts?
Rejewski concluded that the first six letters denoted a »message key« that was identical for the two messages, and from this that the Enigma operators prefixed their messages by a six letter message key. (Later on he even detected that in fact they used a repeated three letter key.)
[Source: F. L. Bauer: Mathematik besiegte in Polen die unvernünftig gebrauchte ENIGMA. Informatik Spektrum 1. Dezember 2005, 493 -497.]