0% found this document useful (0 votes)

17 views9 pages

Lecture 4 - CS50's Computer Science For Lawyers

Uploaded by

Joyden

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views9 pages

Lecture 4 - CS50's Computer Science For Lawyers

Uploaded by

Joyden

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

CS50’s Computer Science for Lawyers

OpenCourseWare

Donate  (https://cs50.harvard.edu/donate)

Doug Lloyd (http://douglloyd.com/), J.D.

[email protected]

David J. Malan (https://cs.harvard.edu/malan/)

[email protected]
 (https://www.facebook.com/dmalan)  (https://github.com/dmalan) 
(https://www.instagram.com/davidjmalan/)  (https://www.linkedin.com/in/malan/) 
(https://www.reddit.com/user/davidjmalan)  (https://www.threads.net/@davidjmalan) 
(https://twitter.com/davidjmalan)

Lecture 4
Overview
Ciphers
Substitution Ciphers
Caesar Cipher
Vigenere Cipher
Frequency Analysis
Other Ciphers
Hashes
Passwords
Hash Functions
Modern Cryptography
SHA-1
Public Key Cryptography
Digital Signatures
Blockchain
Structure
Proof of Work

Overview
 Cryptography is the art and science of obscuring and protecting information.
 We use cryptography to provide a level of security against an adversary who might do bad things with
the information.

Ciphers
 Ciphers are algorithms (a set of step-by-step instructions) used to obscure (or encipher) or reveal (or
decipher) information.

Substitution Ciphers
 A common substitution cipher that we might have seen is pictured here:

 In this cipher, each letter corresponds to a number.

 The attack vector in this cipher, or the vulnerability in this cipher, lies in the pin. Anyone with access
to this pin will be able to break any cipher generated with this pin.
 In this case, the decoder pin is considered the key.

Caesar Cipher
 We can use the ordinal positions of letters in a cipher to generate this key:

A B C D … Y Z

1 2 3 4 … 25 26

 We can also rotate the starting point. If we add 2 to every number, we might use this key:

A B C D … Y Z

3 4 5 6 … 27 28

 Seeing a 27 and 28, though, might be a tipoff that the cipher was shifted in some way. To fix this, we
might want to wrap back to 1 for numbers over 26. We might use this key instead:

A B C D … Y Z

3 4 5 6 … 1 2

 There are only 26 ways to rotate this cipher while preserving the order of the letters. In other words,
we have a very small number of keys that can be used to decipher using this cipher.

Vigenere Cipher
 An improvement we can make to the Caesar cipher is to increase the number of keys.
 While the Caesar cipher uses a single key, the Vigenere cipher uses multiple keys by selecting a
keyword.
 In the Vigenere cipher, for each new letter of message, it is enciphered using a different letter of the
keyword.
 To encrypt the message HELLO using the keyword LAW , we might come up with the following table:

Plaintext H E L L O

Ordinal Position 8 5 12 12 15

Keyword L A W L A

Keyword Ordinal Position 12 1 23 12 1

Sum 20 6 35 24 16

Sum, Wrapping Around 20 6 9 24 16

Ciphertext T F I X P

 For the first letter, or H , the ordinal position is 8. The first letter of our keyword is L , which has
position 12. We add these to positions to get 20, and the letter at position 20 is T . Thus, the
first letter of our cipher text is T .
 When we run out of letters in our keyword, we can re-use our keyword. Note that in the
Keyword row, LAW is repeatededly used.

 Note that the two L s in HELLO mapped to different letters in the ciphertext, namely I and
X . This is different from the Caesar cipher, where the L s would map to the same letter. For
example, if we shifted by 2, all L s would map to N .
 Instead of just 26 possible keywords, we now have 26n possible keywords, where n is the number of
letters in the keyword.

Frequency Analysis
 Another issue with Caesar ciphers is that an adversary may be able to crack the code without a pin.
 For example, if we see a single letter word in the message, we might be able to guess that the
character or number represents I or A . From there, we might be able to discover some patterns in
the message.
 A pattern may be how frequently letters appear in the English language.

A B C D E F G H I J K M

8.1% 1.5% 2.8% 4.3% 12.7% 2.2% 2.0% 6.1% 7.0% 0.2% 0.8% 4.0%

N O P Q R S T U V W X Y

6.7% 7.5% 1.9% 0.1% 6.0% 6.3% 9.1% 2.8% 1.0% 2.4% 0.2% 2.0%

 Some letters appear very frequently, such as E or T and some letters appear very infrequently, such
as J or K . Using these frequencies, we can look at what appears frequently or infrequently in the
ciphertext and perhaps find certain patterns.
 While for humans it might be tedious to conduct frequency analysis to decode a message, a computer
can do it very quickly.

Other Ciphers
 Some ciphers substitute pairs or triples of characters at a time, which is more secure than mapping
one-to-one as previously.
 Some ciphers are transposition ciphers, which algorithmically rearrange the letters in a message.
 The issue with these ciphers is that they’re easily cracked. Furthermore, we must be able to tell an
ally what the key is without telling an adversary.

Hashes
 Generally, while ciphers are reversible, hashes are not.
 To hash some data, we run it through a hash function, which mathematically manipulates it in some
way, and it outputs a value.

Passwords
 A good service does not store passwords in their database. Instead, they store a hash of the password.
 When you input a password, they’ll run the password through the hash function and check whether it
matches the hash stored.
 If the hashes match, odds are the right password was entered.

Hash Functions
 Good hash functions should have the following qualities:
 Use only the data being hashed
 Use all of the data being hashed
 Be deterministic (for the same input, we should always get the same output)
 Uniformly distribute data (when mapping strings to values, strings should map evenly to all
possible values)
 Generate very different hash codes for very similar data
 An example hash function (albeit a bad one) might be to add up the ordinal positions of all the
letters in the hashed string.
 STAR would map to 58.

 ARTS , RATS , SWAP , PAWS , WASP , MULL , BBBBBBBBBBBBBBBBBBBBBBBBBBBBB would also all
map to 58.
 This hash function is good in that it is not reversible. Having the value 58 does not necessarily
tell us that the word was STAR .
 This hash function, however, is bad in that it has a lot of collisions. In cryptography, we’d like to
have no collisions at all.

Modern Cryptography
 Modern cryptography is hashing, and the algorithms tend to work on clusters of characters rather
than going character-by-character, as we did earlier.
 Given data (files, images, videos, etc) of arbitrary sizes, most modern hash functions will map it to a
string of bits that is always the exact same size.
 Good cryptographic hash functions, usually referred to as digests should have the following qualities:
 Be extremely difficult to reverse
 Be deterministic
 Generate very different hash codes for very similar data
 Never allow two different sets of data to hash to the same value
 We use cryptography every day on the internet, whether it’s for…
 Email: encrypting emails from point to point
 Secure web browsing: encrypting web traffic
 VPN: encrypting communications with a network
 Document storage: encrypting pieces of files

SHA-1
 SHA-1 is a famous crytographic hash function first developed by the NSA in the mid-1900s.
 SHA-1 digests are always 160 bits in length, meaning there are 2160, approximately 1048, possible
SHA-1 digests. The probability of a collision wih SHA-1 is equivalent to finding 1 specific grain of sand
on 8 quintillion Earths!
 SHA-1 is now broken—a research team has come up with a way to systematically generate collisions.
For more information, visit shattered.io (https://shattered.io/).
 Since it is now feasible to generate PDFs with the same digital signature, it is possible to create
a valid signature for a high-rent contract by having someone sign a low-rent contract.
 Many other cryptographic standards are in use as well.
 SHA-2 and SHA-3 use more bits in their digest.

 MD5 (no longer secure but used as a checksum) and MD6

Public Key Cryptography

 In public key cryptography, also called asymmetric encryption, there are two functions f(n) and
g(n) , each of which is a one-way function that reverses each other.

 One of those functions is public and anyone can use it to encrupt information intended for you. The
other is private, known only to you, and can be used to reverse the encryption of the first.
 For example, for f(n) , we might have f(n) = n x 8 . Then, f(14) = 112 . This might be our
public function, where if anyone wants to send us a message, they will plug their data into f(n) and
send us that encrypted message.
 Note that an adversary would not be able to undo this function. For example, f(n) = 10 x n
- 28 would also give us f(14) = 112 .

 Generally, to generate these keys, we start with a really large, normally prime, randomly-generated
number. From there, two complementary one-way functions (quite a bit more complicated than our
f(n) ) are generated to create a public-private key pair.

 These keys are typically generated by a program, such as RSA.

 The general flow of asymmetric encryption is shown here:

Digital Signatures
 Digital signatures are almost the inverse of encryption.
 Using a digital signature, one can verify the authenticity of the sender of a document.
 Digital signatures are usually 256 bits, meaning 2256 distinct digital signatures are possible.
 The general flow of verifying a digital signature is shown here:
 The signing part of the process includes the following:
 Starting from the top left, at data, we have a file with a signature.
 We pass this file through a hash function (usually well known) and we get a hash.
 We encrypt the hash with a private key.
 We send the encrypted signature along with the original file.
 The verification part of the process includes the following:
 The digitally signed data includes the original file and the encrypted signature.
 We take the encrypted signature and use the signer’s public key, which is reciprocal of the
signer’s private key. After decrypting, we get a hash.
 We run the file through the same hash function that the signer used to get another hash.
 If these two hashes are equal, then the signature is very likely to be valid.

Blockchain
 Digital signatures and their ease of verification provide the basis for the blockchain.
 The most familiar use of blockchain is for cryptocurrency, such as Bitcoin.
 For a more in-depth mathematical discussion of Bloackchain and Bitcoin, check out 3Blue1Brown’s
video (https://www.youtube.com/watch?v=bBC-nXj3Ng4).

Structure
 The blockchain is essentially a linked list, or a chain of blocks. Instead of storing 3 values (previous
pointer, next pointer, and data), these blocks will store 4 values.
 The “data” in this case is a list of transactions, each transaction having a digital signature.
 For Bitcoin, transactions represent money being exchanged. However, the data doesn’t have to
be a list of transactions. It could be a digitally signed PDF scan of a contract between two
people, among many other possibilities.
 The “proof of work” allows Bitcoin to determine what the true set of transactions, or ledgers, are.
 The ledger is decentralized, meaning each time the data is recorded, everyone must record that
transaction on their own copy of the ledger, in that block.
 To determine which block is legitimate, since everyone has their own copy and could hypothetically
modify it, most cryptocurrencies assume that the chain with the most computational work put into it
is the “true” chain.

Proof of Work
 The proof of work allows us to determine which chain has had the most computational work put into
it.
 To prove a particular block, we hash the block over and over, coupled with some random number, until
we find a highly unusual pattern in the first n out of 256 bits.
 The proof of work is the number that we hash with the block to get that unique pattern.
 For example, we can take a block and hash it with 12345 . Suppose we get the unique pattern of
zeroes and ones.
 To verify this block, someone could simply hash the proof of work, that we announce, with the
block to find that unique pattern of zeroes and ones.
 This way, we can prevent a fraudulent transaction.
 Assume Person A is initiating a fraudulent transaction and announces to Person B that A will
pay B 100 dollars. A only announces this transaction to B and not to anyone else maintaining
their own copies of this blockchain.
 Person A must verify that transaction. Person A appends that transaction to their own copy of
the blockchain, and in order to prove that block, person A must find a number, that when hashed
with the entire block, produces the unique pattern before anyone else does.
 Additionally, other transactions are going into Person A’s ledger, and Person A will have to keep
proving their work over and over. This means that Person A must stay ahead and continuously
find the secret numbers that when hashed create that unique pattern. This is to ensure that
Person A’s blockchain is considered the correct set of transactions.
 Person A cannot keep up with this, and at some point, some other chain will be considered
valid.
 The smallest change in any of the transactions would produce a totally different hash, making that
block unverified.
 The longer a chain gets, the more likely it is that the chain consists of only verified transactions.

Unit 1 Notes
No ratings yet
Unit 1 Notes
55 pages
Lecture 07 Hash Function Updated
No ratings yet
Lecture 07 Hash Function Updated
106 pages
Complete MODULE 2
No ratings yet
Complete MODULE 2
17 pages
CNSL Oral Notes
No ratings yet
CNSL Oral Notes
45 pages
Knowledge Discovery in Database
No ratings yet
Knowledge Discovery in Database
10 pages
Unit III
No ratings yet
Unit III
12 pages
Css QB
No ratings yet
Css QB
17 pages
Classical Ciphers
No ratings yet
Classical Ciphers
96 pages
Very Short Notes of Dbms RGPV (CS502)
No ratings yet
Very Short Notes of Dbms RGPV (CS502)
17 pages
Analytical Models in Parallel Computing
No ratings yet
Analytical Models in Parallel Computing
82 pages
Wk4 1 Hash
No ratings yet
Wk4 1 Hash
8 pages
Quiz2 Fall22
No ratings yet
Quiz2 Fall22
4 pages
Lab 1
No ratings yet
Lab 1
10 pages
Orchidea Configuration Guide
No ratings yet
Orchidea Configuration Guide
4 pages
Random Matrix Theory: Wigner-Dyson Statistics and Beyond. Lecture Notes Given at SISSA (Trieste, Italy)
No ratings yet
Random Matrix Theory: Wigner-Dyson Statistics and Beyond. Lecture Notes Given at SISSA (Trieste, Italy)
29 pages
Study Notes Cryptography 2022
No ratings yet
Study Notes Cryptography 2022
71 pages
Gauge R&R
No ratings yet
Gauge R&R
7 pages
An Ensemble Method For Phishing Websites Detection Based On XGBoost
No ratings yet
An Ensemble Method For Phishing Websites Detection Based On XGBoost
6 pages
Hashfuncs 6up
No ratings yet
Hashfuncs 6up
4 pages
Alzebra
No ratings yet
Alzebra
3 pages
CryptoHashRevised Handout 0325
No ratings yet
CryptoHashRevised Handout 0325
28 pages
Easttom PPT 08 Final
No ratings yet
Easttom PPT 08 Final
38 pages
T 1
No ratings yet
T 1
9 pages
Box-Packing Puzzles
No ratings yet
Box-Packing Puzzles
4 pages
Module 3-Tutorial Sheet
No ratings yet
Module 3-Tutorial Sheet
4 pages
Lecture 5 - Signature-Hash-MAC
No ratings yet
Lecture 5 - Signature-Hash-MAC
42 pages
Module 2 P
No ratings yet
Module 2 P
48 pages
Week 4
No ratings yet
Week 4
22 pages
Week 4
No ratings yet
Week 4
23 pages
Crypto Slides
No ratings yet
Crypto Slides
74 pages
Lecture 11
No ratings yet
Lecture 11
77 pages
Grade 10 Probability Worksheet
No ratings yet
Grade 10 Probability Worksheet
4 pages
Assignment 6
No ratings yet
Assignment 6
9 pages
Unit 3:group B Test 9-Klasse
No ratings yet
Unit 3:group B Test 9-Klasse
3 pages
Assignment6 Cse330 Spring 2024
No ratings yet
Assignment6 Cse330 Spring 2024
1 page
Lecture 2. Cryptography PDF
No ratings yet
Lecture 2. Cryptography PDF
33 pages
Introcryptography 1
No ratings yet
Introcryptography 1
49 pages
Security Chapter 3
No ratings yet
Security Chapter 3
60 pages
CH02
No ratings yet
CH02
32 pages
Crypto Summary
No ratings yet
Crypto Summary
76 pages
Cryptographic Hash Functions Guide
No ratings yet
Cryptographic Hash Functions Guide
43 pages
Lecture 9
No ratings yet
Lecture 9
30 pages
Modelling Mechanisms For Measurable and Detection Based On Artificial Intelligence
No ratings yet
Modelling Mechanisms For Measurable and Detection Based On Artificial Intelligence
6 pages
Cybersecurity Exercises Guide
No ratings yet
Cybersecurity Exercises Guide
47 pages
HAsh
No ratings yet
HAsh
18 pages
Chapter 5 Encryption
No ratings yet
Chapter 5 Encryption
37 pages
Cryptography
No ratings yet
Cryptography
75 pages
Cybersecurity: Cryptography Basics
No ratings yet
Cybersecurity: Cryptography Basics
33 pages
Digital - Chapter3.k Map
No ratings yet
Digital - Chapter3.k Map
18 pages
696643232M.A Session 1 Chapter 3
No ratings yet
696643232M.A Session 1 Chapter 3
2 pages
Cryptography I: General Concepts and Some Classical Ciphers
No ratings yet
Cryptography I: General Concepts and Some Classical Ciphers
60 pages
Lab01 - Classical Cryptography
No ratings yet
Lab01 - Classical Cryptography
10 pages
Applied Machine Learning For Engineers: Artificial Neural Networks
0% (1)
Applied Machine Learning For Engineers: Artificial Neural Networks
6 pages
1 Crypto
No ratings yet
1 Crypto
83 pages
Psi Lect Hashes 2 PPT
No ratings yet
Psi Lect Hashes 2 PPT
25 pages
Computer and Network Security: Modern Cryptography Overview: Kameswari Chebrolu
No ratings yet
Computer and Network Security: Modern Cryptography Overview: Kameswari Chebrolu
42 pages
Ecs726p Week04 P
No ratings yet
Ecs726p Week04 P
56 pages
Lec1 Intro
No ratings yet
Lec1 Intro
23 pages
Advanced Machine Learning Lab Syllabus
No ratings yet
Advanced Machine Learning Lab Syllabus
4 pages
Chapt-2 Crypthoprimers
No ratings yet
Chapt-2 Crypthoprimers
32 pages
Ccsb3113 - Chapter 11-12 Hash+Msg Authn
No ratings yet
Ccsb3113 - Chapter 11-12 Hash+Msg Authn
59 pages
09 Security
No ratings yet
09 Security
51 pages
Unit 2.1
No ratings yet
Unit 2.1
28 pages
1.1. Formulation of LPP: Chapter One: Linear Programming Problem/LPP
No ratings yet
1.1. Formulation of LPP: Chapter One: Linear Programming Problem/LPP
54 pages
Cryptographic Hash: MODULE 2 (Chapter 2)
No ratings yet
Cryptographic Hash: MODULE 2 (Chapter 2)
32 pages
The Lagrangian Relaxation Method For Solving Integer Programming Problems
No ratings yet
The Lagrangian Relaxation Method For Solving Integer Programming Problems
12 pages
Encryption Notes
No ratings yet
Encryption Notes
11 pages
ECS726-Week04 - Hash - MAC - Digital Sinatures - Freshness - Dynamic Password Schemes
No ratings yet
ECS726-Week04 - Hash - MAC - Digital Sinatures - Freshness - Dynamic Password Schemes
52 pages
IT Security - Hash
No ratings yet
IT Security - Hash
30 pages
Chapter3 PDF
No ratings yet
Chapter3 PDF
22 pages
Cost Estimation Methods Guide
No ratings yet
Cost Estimation Methods Guide
2 pages
Cryptographic Hash Functions
100% (1)
Cryptographic Hash Functions
31 pages
Multiplying Floating Point Numbers
No ratings yet
Multiplying Floating Point Numbers
8 pages
L3 (Keys) - Cyber Security
No ratings yet
L3 (Keys) - Cyber Security
48 pages
ch03 Crypto8e
No ratings yet
ch03 Crypto8e
34 pages
A Review On Image Retrieval Techniques
No ratings yet
A Review On Image Retrieval Techniques
4 pages
Quantitative Risk Analysis: TEJ PANDYA 80303160091
No ratings yet
Quantitative Risk Analysis: TEJ PANDYA 80303160091
5 pages
A Tableau-Based Theorem Proving Method For Intuitionistic Logic
No ratings yet
A Tableau-Based Theorem Proving Method For Intuitionistic Logic
8 pages
Hash Functions: EJ Jung
No ratings yet
Hash Functions: EJ Jung
15 pages
Crypto 2
No ratings yet
Crypto 2
10 pages
Hash Functions Cryptography Powerpoint Presentation Notes Security
No ratings yet
Hash Functions Cryptography Powerpoint Presentation Notes Security
31 pages
Prajwal. K
No ratings yet
Prajwal. K
31 pages
Cryptographic Hash Functions Guide
No ratings yet
Cryptographic Hash Functions Guide
54 pages
Security+ 4-2C All Slides
No ratings yet
Security+ 4-2C All Slides
652 pages
INS Unit - 4 1
No ratings yet
INS Unit - 4 1
30 pages
Cryptographic Hash Functions: Purpose
No ratings yet
Cryptographic Hash Functions: Purpose
20 pages
Notes 08 - Nichols Charts & Closed Loop Performance
No ratings yet
Notes 08 - Nichols Charts & Closed Loop Performance
3 pages
Cryptography and Network Security: Fifth Edition by William Stallings
No ratings yet
Cryptography and Network Security: Fifth Edition by William Stallings
23 pages
Cryptographic Hash Functions Guide
No ratings yet
Cryptographic Hash Functions Guide
25 pages

Lecture 4 - CS50's Computer Science For Lawyers

Uploaded by

Lecture 4 - CS50's Computer Science For Lawyers

Uploaded by

CS50’s Computer Science for Lawyers

Doug Lloyd (http://douglloyd.com/), J.D.

David J. Malan (https://cs.harvard.edu/malan/)

 In this cipher, each letter corresponds to a number.

Keyword Ordinal Position 12 1 23 12 1

Sum, Wrapping Around 20 6 9 24 16

 MD5 (no longer secure but used as a checksum) and MD6

Public Key Cryptography

 These keys are typically generated by a program, such as RSA.

You might also like