100 Years of Cryptanalysis: Compositions of Permutations (Non-Commutative) 1918 => Modern Block Ciphers!

100 years of Cryptanalysis: Compositions of Permutations (Non-Commutative) 1918 => Modern Block Ciphers!

complicated?

Nicolas T. Courtois University College London, UK or How Some Rejewski Mathematicians Turing Won the WW2…

Zygalski Welchman Code Breakers Encryption

ciphertext

plaintext

-self-reciprocicity = involution pty 3 Nicolas T. Courtois, 2012 -no letter encrypted to itself GOST, Self-Similarity and Cryptanalysis of Block Ciphers Think Inside the Box

I = inputs

Involution F(x)=y key???  ????? F(y)=x

O =outputs

4 © Nicolas T. Courtois, 2006-2013 GOST, Self-Similarity and Cryptanalysis of Block Ciphers The Box - General Setting

I = inputs R/W

????? data??? key???

R/W O =outputs

5 © Nicolas T. Courtois, 2006-2013 GOST, Self-Similarity and Cryptanalysis of Block Ciphers Algebraic Cryptanalysis [Shannon] Breaking a « good » cipher should require:

“as much work as solving a system of simultaneous equations in a large number of unknowns of a complex type”

[Shannon, 1949]

6 © Nicolas T. Courtois, 2006-2013 GOST, Self-Similarity and Cryptanalysis of Block Ciphers Bottom Line From FIRST cipher machines in 1920s to todays’ block ciphers, cryptanalysis has NOT changed so much (!!).

Same guiding principles.

7 © Nicolas T. Courtois, 2006-2013 GOST, Self-Similarity and Cryptanalysis of Block Ciphers Motivation Linear and differential cryptanalysis usually require huge quantities of known/chosen plaintexts.

Q: What kind of cryptanalysis is possible when the attacker has only one known plaintext (or very few) ?

=real world LOW DATA CRYPTANALYSIS attacks

8 © Nicolas T. Courtois, 2006-2013 Code Breakers Marian Rejewski December 1932: reverse engineering of Enigma rotors

– “the greatest breakthrough in cryptanalysis in a thousand years” [David Kahn]

– cf. John Lawrence, "A Study of Rejewski's Equations", Cryptologia, 29 (3), July 2005, pp. 233–247. + other papers by the same author

9 Nicolas T. Courtois, 2012 Code Breakers

Rotors 26 relative settings

Difficult to obtain for the enemy…

10 Nicolas T. Courtois, 2012 Code Breakers Commercial Enigma [1920s] insecure

combines several permutations on 26 characters…

11 Nicolas T. Courtois, 2012 Code Breakers Rotor Stepping Regular

odometer-like

12 Nicolas T. Courtois, 2012 Code Breakers *Rotor Stepping Regular

period=?

13 Nicolas T. Courtois, 2012 Code Breakers *Rotor Stepping Regular

period=263 or actually 262*25?

14 Nicolas T. Courtois, 2012 Code Breakers Rotor Stepping

• Rotating a rotor: N • N becomes C-1○N○C (p) • C is a circular shift a↦b…

• Proof:

15 Nicolas T. Courtois, 2012 Code Breakers Bâtons/Rods Attack Used by French/British/Germans to break Swiss/Spanish/Italian/British ciphers in the 1930s… • assumes only first rotor moving • rotor wiring known • guess which rotor is at right p • guess starting position (26) • guess SHORT crib [plaintext] • t=0 c=N-1 ○ Z ○ N (p) • t=0 Z ○ N(p) = N(c) N • Z is an involution

• Rotating a rotors: c • P becomes C-1○P○C (p) • C is a circular shift a↦b…

• t=i Z ○ C-iNCi (p) = C-iNCi (c) Z

•attack worked until 1939 [cf. Spanish civil war] 16 Nicolas T. Courtois, 2012 •Germans: avoid the attack since 1929/30 with a steckerboard Code Breakers Bâtons/Rods Attack Example: i=0 -guess crib = 14 letters plain -guess which rotor is rightmost N(p) -check ALL 26 starting position pairs for Z obtained: N(c)

210

17 Nicolas T. Courtois, 2012 GOST, Self-Similarity and Cryptanalysis of Block Ciphers Think Inside the Box

I = inputs

rightmost rotor position

Involution confirms the data??? key??? key choice ?????

like OK or “SAT”

rightmost rotor position

O =outputs

18 © Nicolas T. Courtois, 2006-2013 GOST, Self-Similarity and Cryptanalysis of Block Ciphers the Box – General Setting

I = inputs R/W

????? data??? key???

R/W O =outputs

19 © Nicolas T. Courtois, 2006-2013 GOST, Self-Similarity and Cryptanalysis of Block Ciphers *Modern Method - Contradictions - SAT Solvers -

20 © Nicolas T. Courtois, 2006-2013 GOST, Self-Similarity and Cryptanalysis of Block Ciphers SAT / UNSAT Immunity

See:

Nicolas Courtois, Jerzy A. Gawinecki, Guangyan Song: Contradiction Immunity and Guess-Then-Determine Attacks On GOST, in Tatracrypt 2012.

21 © Nicolas T. Courtois, 2006-2013 GOST, Self-Similarity and Cryptanalysis of Block Ciphers UNSAT Immunity – Block Ciphers Guess 78 bits => Contradiction with SAT solver software 50 %of the time

We say that for 8 rounds of GOST the UNSAT Immunity is at most 78 [Tatracrypt 2012]

22 © Nicolas T. Courtois, 2006-2013 GOST, Self-Similarity and Cryptanalysis of Block Ciphers Guess Then Eliminate Depth-First Tree Search.

78 78 78 bits bits bits X X

good

23 © Nicolas T. Courtois, 2006-2013 GOST, Self-Similarity and Cryptanalysis of Block Ciphers SAT Immunity – 4 pairs Guess these 68 bits.

=> all the other bits?

24 © Nicolas T. Courtois, 2006-2013 GOST, Self-Similarity and Cryptanalysis of Block Ciphers SAT Immunity – 4 pairs Guess these 68 bits.

=> all the other bits are found in 400 s on one laptop i7 CPU  using CryptoMiniSat x64 2.92.

Corollary: Given 4KP for 8R we determine all the key bits in time 294. [Courtois Cryptologia vol 37, 2013]

25 © Nicolas T. Courtois, 2006-2013 Code Breakers Military Enigma [1930s]

stecker= plugboard

Added in 6/1930:

26 Nicolas T. Courtois, 2012 Code Breakers Stecker Huge challenge for code breakers

*common point in all good Enigma attacks: eliminate the stecker, “chaining techniques”… also for Abwehr

27 Nicolas T. Courtois, 2012 Code Breakers •6 plugs until Nov 1937 Stecker •variable 5-8 plugs… S=involution, •10 plugs Nov 1939=>most of the war 6 fixed points

2 holes/letter no plug => E->E etc.

28 Nicolas T. Courtois, 2012 Code Breakers Military Enigma

involution, 13 pairs picture by D. Davies

29 Nicolas T. Courtois, 2012 Code Breakers Key Size About 2380 with rotors Only 276 when rotors are known. 5 main rotors were found by Polish mathematicians before WW2 started.

Same 3 rotors used since 1920s… until 1945!!! BIG MISTAKE.

30 Nicolas T. Courtois, 2012 Code Breakers

Part 1

Permutations non-commutative PoQ ≠ QoP

31 Nicolas T. Courtois, 2012 Code Breakers

INI Methods

32 Nicolas T. Courtois, 2012 Code Breakers Two Main Families of Machines • Self-reciprocical = involution, • e.g. Enigma • E/D switch: • e.g. Fialka, KL7, Typex…

keys, settings, E/INI etc…

OUT

33 Nicolas T. Courtois, 2012 Code Breakers Message key Message key=session key=ephemeral key Should never repeat for two different messages, makes encryption probabilistic Transmitted to the receiver encrypted (E), must be decrypted (D) by the receiver.

message key message daily key = header daily setting E/INI transmitted (24 h)

34 Nicolas T. Courtois, 2012 Code Breakers

1.1.

Enigma INI

35 Nicolas T. Courtois, 2012 Code Breakers

History of Enigma Initialization – 3 Periods

6 digits header = E(session key) Method 1 – 2 Mistakes encryption done twice, lots of data with one « daily key » • 15 Sept 1938 9 digits header = Method 2 - 1 Mistake twice E(session key) with a random only 6 chars with the same key! • 1 May 1940 Method 3 - 0 Mistakes 6 digits header no more repeated encryption

36 Nicolas T. Courtois, 2012 Code Breakers

*Method 1 – 2 Mistakes

Except… before 15 Sept 1938 •the intelligence agency of the SS •and the Nazi Party did not make the change until 1 July 1939.

37 Code Breakers *Additional Help: Message Keys NOT Random

(should be 3 random letters)

AAA XYZ ASD QAY

Operators always found a way to «degrade » their security

38 Code Breakers

Method 1 – 2 Mistakes

before 15 Sept 1938 encrypted

3 digit « random » message key lots of redundant repeat twice data - same 3 3 daily key! daily settings: -rotors I III IV -ring settings 3 3 6-digit E header -start position 3 « fixed secret IV » shared 39 Nicolas T. Courtois, 2012 Code Breakers

Method 2 – 1 Mistake

15 Sept 1938 - 1 May 1940 [sometimes also used later, e.g. Norway, Malta 1942]

3 digit « random » message key only 6 digits with the same repeat twice random start 3 3 position daily settings: 3 -rotors I III IV 9-digit -ring settings 3 3 E header -random start 3 «random IV »

40 Nicolas T. Courtois, 2012 Code Breakers

Method 3 – 0 Mistakes

after 1 May 1940

3 digit « random » message key perfectly randomised once method only 3 digits 3 with a random start daily settings: position 6-digit -rotors I III IV 3 header -ring settings E 3 -random start 3

41 Nicolas T. Courtois, 2012 Code Breakers

Part 3

Polish Attacks

Rejewski Zygalski

42 Nicolas T. Courtois, 2012 Code Breakers

Three Periods in the History of Enigma

Early Polish Methods 2 Mistakes encryption done twice, lots of data with one « daily key » • 15 Sept 1938 Zygalski Method 1 Mistake implemented/used at BP • 1 May 1940 0 Mistakes [Herivel Attack] Turing-Welchman Bombes

43 Nicolas T. Courtois, 2012 Code Breakers

Method 2 – 1 Mistake

15 Sept 1938 - 1 May 1940 [sometimes also used later, e.g. Norway, Malta 1942]

3 digit « random » message key only 6 digits with the same xyzxyz random start 3 3 position daily settings: 3 -rotors I III IV 9-digit -ring settings 3 3 E header -random start 3 «random IV »

44 Nicolas T. Courtois, 2012 Code Breakers

focus on repeated indicator:

15 Sept 1938 - 1 May 1940 [sometimes also used later, e.g. Norway, Malta 1942]

3 digit « random » message key

xyzxyz our 6 digits ciphertext 3 3 daily settings: - header rotors I III IV 3 c d e -ring settings 3 3 E c’d’e’ -random complex 3 «randomkey for IV the» whole machine (arguably not very useful) 45 Code Breakers Key Principle in Lots of Enigma Attacks

same x

At steps 1 and 4 s T=1 R’1 (x) = c T=4 R’ (x) = c’ t x 4 e  the attacker can OBTAIN pairs for: c -1 k R’4 ○ R’1 c ↦c’ e c r IMPORTANT: R’4 is an involution => We get to know pairs for a special permutation R’1 ○ R’4 R’i 46 Nicolas T. Courtois, 2012 Code Breakers Magic = Permutation Factoring!

At steps 1 and 4 s T=1 R’1 (x) = c T=4 R’ (x) = c’ t x 4 e  the attacker can OBTAIN pairs for: c -1 k R’4 ○ R’1 e c BOTH are involutions r => we CAN recover BOTH by factoring R’1 ○ R’4 [due to Rejewski Theorem, they map cycles to identical cycles, cf. slide 138] Lemma: requires 74 events on average canR’ be recovered!i 47 Nicolas T. Courtois, 2012 Code Breakers ***inside we have

same x

At steps 1 and 4 -1 s T=1 S ○ R1 ○ S(x) = c T=4 S-1 ○ R ○ S(x) = c’ t x 4 e  the attacker can RECOVER the combination of these 2 involutions N Sc -1 -1 -1 k S ○ R4 ○ S ○ S ○ R1 ○ S c ↦c’ e c r

-1 R4 ○ R1 S(c) ↦S(c’) Ri Q the same, high prob >≈ 0.75, no movement 48 Nicolas T. Courtois, 2012 Code Breakers Do We Have Enough Data ≥ 74?

same x

At steps 1 and 4 s T=1 R’1 (x) = c T=4 R’ (x) = c’ t x 4 e  the attacker can RECOVER -1 c both R’4 and R’1. k before 1938, 2 mistakes, -1 e c R4 ○ R1 was fixed in all messages in 1 week or so… r  74 samples => recover R’i by factoring… => recover all rotors and break keys… Sept 1938 - 1 May 1940, 1 mistake, -1 R4 ○ R1 was different in each message, cf. Zygalski attack  could be observed only once  attacker can see when it has a fixed point = so called ‘females’ [from Polish/English R’i pun same/samica]. 49 Nicolas T. Courtois, 2012 Code Breakers

Early Polish Methods: before Sept 1938

Cyclometer

Making a catalogue for breaking Enigma…

Key task for Rejewski…

50 Nicolas T. Courtois, 2012 Code Breakers

15 Sept 1938

Panic in Polish Cipher Bureau!

2 new methods were invented: Polish Bombs [pbs with stecker 6=>more] Zygalski sheets [BEST, eliminates the stecker totally => massively used during WW2]

51 Nicolas T. Courtois, 2012 Code Breakers

Second Generation Enigma Attacks

Rejewski Zygalski

52 Nicolas T. Courtois, 2012 GOST, Self-Similarity and Cryptanalysis of Block Ciphers Conjugation

“Theorem Which Won World War 2”, [I. J. Good and Cipher A. Deavours, afterword to: Marian Rejewski, "How Polish Mathematicians Deciphered the Enigma", Annals of the History of Computing, 3 (3), July 1981, 229-232] P and Q-1 o P o Q have the same cycle structure

53 © Nicolas T. Courtois, 2006-2013 Code Breakers

*Polish Bombe: worked until 1 May 1940

Short cycles… assumed stecker not active. required MANY messages from the same setting…

-1 R’4 ○ R’1 c ↦c’ 54 Nicolas T. Courtois, 2012 Code Breakers