0% found this document useful (0 votes)

16 views46 pages

w7 Encrypted Search

Uploaded by

natalka.ciko

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views46 pages

w7 Encrypted Search

Uploaded by

natalka.ciko

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

DS517 – Data Security

Lecture 6
Encrypted Search
Elias Athanasopoulos
[email protected]
Data and storage
• Data is collected by several applications or
services and it is further processed
– Data can be sensitive
• We store data in database management systems
(DBMSs)
– They support storing, searching and retrieval of data,
among others
• A database can be conceptualized as
– The core database that focuses on indexing/searching
– The DBMS, which is the software that performs data
accessing

2
DBMSs
• Essentially, they perform several actions
beyond storing and searching for data
– Enforcing data access policies
– Defining data structures
– Providing transaction guarantees
– Visualization and analytics
• We focus on the traditional database’s core
functions
– Data insertion, indexing, and search

3
Protected database search
• Use cryptography to separate the roles of providing,
administering, and accessing data
• Server is not aware of the data stored
– Data breach is not possible
– Server performs actions on encrypted data, without
reading the plaintext data
• Wide variety of techniques
– Property-preserving encryption, searchable symmetric
encryption, private information retrieval by keyword,
oblivious RAM
• A protected search system must balance between
security, functionality, performance, and usability

4
Our goals
• Understand the current and future state of
database technology, enabling focus on
techniques that will be useful in future DBMSs
• Help security and database experts
understand the tradeoffs between protected
search systems so they can make an informed
decision about which technology, if any, is
most appropriate for their setting

5
Overview of database systems
• Relational databases (SQL)
– Strong transactional guarantees
– Vertically scalable: better performance through greater
hardware resources
– ACID (Atomicity, Consistency, Isolation, and Durability)
• NoSQL (not only SQL)
– Fast data insertion, flexible data structures, relaxed
transactional guarantees
– For large amounts of unstructured data
• NewSQL
– Scalability of NoSQL databases
– Transactional guarantees of relational databases
• Future systems

6
Query bases
• We define a small set of base operations that can
be combined to provide complex search
functionality
– Relational algebra (SQL): set union, set difference,
cartesian product (joins), projection and selection
– Associative arrays (key-value store for NoSQL):
construction, find, addition, element-wise
multiplication, array multiplication
– Linear algebra (NewSQL): construction, find, matrix
addition, matrix multiplication, element-wise
multiplication

7
Example systems

8
Database roles
• Provider
– Provides and modifies the data
• Querier
– Wishes to learn about the data
• Server
– Handles storage and processing
• Authorizer
– Specifies data- and query-based rules
• Enforcer
– Ensures the rules are applied

9
Database operations
Init/Query
• Init
– The initialization protocol occurs between the provider
and the server
– The server obtains a protected database representing the
loaded data
• Query
– The query protocol occurs between the querier (with a
query), the server (with the protected database), the
enforcer (with the rules), and possibly the provider
– The querier obtains the query results if the rules are
satisfied
• All systems we discuss support Init/Query

10
Database operations
Update/Refresh
• Update
– The update protocol occurs between the provider (with a set of
updates) and the server
– The server obtains an updated protected database
– Updates include insertions, deletions, and record modifications
• Refresh
– The refresh protocol occurs between the provider and the
server
– The server obtains a new protected database that represents
the same data but is designed to achieve better performance
and/or security
• Some of the systems we discuss support Update/Refresh

11
Protected database
search systems
• A system that supports the roles and operations, in which
each party learns only its intended outputs (informally)
• Ensures that the server learns nothing about the data
stored in the protected database or about the queries, and
the querier learns nothing beyond the query results
– Formalized using the real-ideal style of cryptographic definition
• Ideal
– A protected search system, in which a trusted external party
performs storage, queries, and modifications correctly and
reveals only the intended outputs to each party
• The real system is secure if no party can learn more from its
real world interactions than it can learn in the ideal system

12
Formal guarantees
• We focus on systems that provide formally
defined security guarantees
• Some other commercial systems are based on
cryptographic techniques with security proofs
– Further analysis needs to be contacted to
conclude if there are security deviations

13
Scenarios
• A few existing protected search systems consider the
enforcement of rules using an authorizer and enforcer
• Three-party scenario
– A provider, a querier, and a server
• Two-party scenario
– A single user (the client) acts as both the provider and the
querier
– E.g., a cloud-storage app in which a client uploads files to
the cloud that she can later search
– Client knows all information, so we consider security
against an adversarial server
• We focus on a single provider and single querier setting

14
Threats
• An adversary can be either an insider or an outsider
• Semi-honest adversary (honest-but-curious)
– They follow the protocol but may passively attempt to
learn additional information
• Malicious adversary
– Actively willing to reveal secret information
• An adversary me be persistent for the lifetime of the
database or having access to a snapshot
• Most common threat model
– Semi-honest, persistent, insider adversary

15
Performance and leakage
• Unprotected databases are I/O bound, while protected
databases are CPU/network bound
– Cryptography may be computational heavy (asymmetric)
or not that much (symmetric)
• A very slow system can be be very secure but not
usable
– A faster system leaks information
• Leakage profile
– A sequence of functions that formally describe all
information that is revealed to each party beyond the
intended output
– Can be complex

16
Common Leakage Profiles
• A leakage profile is composed by
– Objects that leak
– The type of information that is leaked
– Which operation leaks
– The party that learns the leakage

17
Common Leakage Profiles
Objects
• Objects vulnerable to leakage
– Data items and indexing data structures
– Queries
– Records return as a response or other
relationships between query and data
– Access-control rules

18
Common Leakage Profiles
Information that leaked
• Structure
– Properties of an object only concealable via padding
– E.g., length of a string, the cardinality of a set
• Predicates
– Identifiers plus additional information on objects
– E.g., “matches the intersection of 2 clauses within a query”
and “within a common (known) range.”
• Equalities
– Which objects have the same value
• Order (or more)
– Numerical or lexicographic ordering of objects, or perhaps
even partial plaintext data

19
Common Leakage Profiles
Operation
• Init
– The server may learn about the initial data
• Query
– The querier may learn about the rules and the current data
– The server may learn about the query, the rules, and the
current data
– The provider may learn about the query and rules
– The enforcer may learn about the query and current data
• Update
– The server may receive learn about prior/new data records
• Refresh
– The server may learn about the current data

20
Protected search systems
approach
• Legacy
– The approach can be used with an unprotected
database server
• Custom index
– Based on special-purpose protected indices and
customized protocols
• Oblivious Index
– Subset of Custom index that, additionally,
obscures object identifiers

21
Base queries supported
• There are cryptographic protocols for
supporting base queries
– Equality, range, and boolean queries
• Additional query types have been developed
– Denoted as “Other” in the final systemization

22
Performance and usability
• Scale
– The scale of updates and queries that each
scheme has been shown to support
• Crypto
– The type and amount of cryptography required to
support updates and queries
• Network
– The network latency and bandwidth
characteristics

23
Review of proposals
Legacy
• Property-preserving encryption allows operations
(e.g., equality or order) on ciphertexts that
preserve some property of the underlying
plaintexts
• Legacy databases can support those actions by
simply encrypting the data
– No other changes needed in the database
• Types of encryption
– Deterministic encryption (DET) for equality
– Order-preserving encryption (OPE) for range queries

24
Review of proposals
Custom inverted index
• Support for equality searches
– On single-table databases via a reverse lookup
that maps each keyword to a list of identifiers for
the database records containing the keyword
• Support for Boolean queries
– The inverted index finds the set of records
matching the first term in a query, and a second
index containing a list of (record identifier,
keyword) pairs is used to check whether the
remaining terms of the query are also satisfied

25
Review of proposals
Custom tree traversal
• Based on indices with a tree-based structure
• A query is executed by traversing the tree and
returning the leaf nodes at which the query
terminates
• The main cryptographic challenge here is to
hide the traversal pattern through the tree,
which can depend upon the data and query

26
Review of proposals
Other custom indices
• These schemes mostly work by building
encrypted indices out of specialized data
structures for performing the specific query
computation

27
Review of proposals
Obliv
• Systems that implement Oblivious RAM
(ORAM) protocols aim to hide access patterns
in memory
• The main idea is to re-arrange the contents of
data in memory, for each query, in order to
obscure the relationships between data and
memory access
• The challenge is to do this efficiently

28
Systemization

29
Leakage inference attacks
• Protected search systems are evaluated
against leakage
• A protected search scheme is affected by an
attack if the scheme’s leakage to the server is
at least as large as the attack’s required
minimum leakage

30
Leakage inference attacks
Attack requirements
• Attacker goal
– Recover a set of queries asked by the querier (query recovery)
or the data being stored at the server (data recovery)
• Required leakage
– Cardinality of a response set, the ordering of records in the
database, and identifiers of the returned records, etc.
• Attacker model
– Semi-honest, data injection (insert data in the database)
• Attacker prior knowledge
– Contents of full dataset (for attackers that want to recovery
queries), contents of a subset of dataset, distributional
knowledge of dataset, distributional knowledge of queries,
keyword universe (knowledge of the possible values of each
field)

31
Leakage inference attacks
Attack efficacy
• The runtime of the attack, including time
required to create any inserted records
• The sensitivity of the recovery rate to the
amount of prior knowledge
• The keyword universe size attacked

32
Leakage inference attacks
Attack techniques
• Many attacks published, but in principle they
all rely on two facts
– Different keywords are associated with different
numbers of records
– Most systems reveal keyword identifiers for a
record either at rest or when it is returned across
multiple queries

33
Leakage inference attacks
Example
• Assume the attacker has full knowledge of the
database and is trying to learn the query
– With 80% of the dataset known, the attack can
yield a 40% keyword recovery rate
• The attacker sees how many records are
returned in response to a query
– If the number is unique (per query) then the
query is identified
– Also the attacker can tell that every returned
record is associated with the keyword
34
Leakage inference attacks
Example
• Suppose that the attacker learns that the first query was for
LastName = ‘Smith’ (unique record number in response)
• Consider a second query that does not return a unique
number of records in response
– FirstName may be ‘John’ or ‘Mathew’ and both return 1,000
records
• The attacker checks how many records overlap with the
first query
– For example, there may be 100 records with ’Mathew Smith’
and only 10 with ‘John Smith’
– By checking overlaps, the attacker can reveal the first name
• The attacker can iteratively identify queries and create
constraints for further identifying unknown queries

35
Systemization

36
Leakage inference attacks
Discussion
• The provider and querier should be protected
against the server
– Privacy, the server may be compromised, etc.
• Which technique should be used?
1) How long is the keyword universe?
2) How much of the dataset or query keyword universe
(and frequency) can the attacker predict?
3) Can an attacker reasonably insert crafted records?
4) Does the adversary have persistent access to the
server, or to a snapshot at a given time?

37
Leakage inference attacks
Discussion
• Answers to the first three questions depend upon the
intended use case
– Α system with a smaller leakage profile may be necessary
in a setting where the keyword universe is small and the
attacker has the ability to add records
– A system with a larger leakage profile may suffice in a
setting where the keyword universe is very large
• The fourth question relates to adversaries that
compromise the server
– Legacy schemes leak the entire database (one snaphsot
should be enough)
– Custom schemes leak information during query (persistent
compromise is needed)

38
Leakage inference attacks
Summary
• Each protected search approach has a distinct
leakage profile that results in qualitatively
different attacks
– If queries only touch a small portion of the dataset
or the adversary only has a snapshot, the impact
of leakage from Custom systems is less than from
Legacy schemes
– If queries regularly return a large fraction of the
dataset, this distinction disappears and an Obliv
scheme may be appropriate

39
Extending Functionality
• There are techniques for combining base
queries (equality, Boolean, etc.) to richer ones
• Schemes that support a given query type by
composing base queries tend to have more
leakage than schemes that natively support
the same query
• But, in query composition, a scheme can be
extended straightforwardly to support
multiple query types

40
Extending functionality

41
Extending database systems
Controls, rules and enforcement
• Database systems support several types of access
control mechanisms which may constraint the
interaction of users (or programs) with data
• In addition, query control limits which queries are
acceptable
– In contrast to typical access control which mandates
which data is accessible
– Example, a query needs to to specify at least five
columns, in order to be sufficiently targeted

42
Extending database systems
Performance characterization
• Adding protections to the database system,
such as encryption, may affect the
performance
• Response times depend heavily on
– Network capacity, load and number of records
returned by the query
– Ordering of terms in subclauses within a query
– Complexity of rules based on query policy and
access control

43
Extending database systems
User perceptions and performance
• Users may not be ready to use protected search
systems
• In a controlled user study, it was evident that
– When response times were unpredictable,
participants were unsure whether they should wait for
a query to complete or do something else
– Participants felt the protected technologies were
slower than an unprotected system
– Participants were surprised that different types of
queries might have different performance
characteristics
44
Extending database systems
Current protected search databases

45
References
• SoK: Cryptographically Protected Database
Search, in Oakland 2017, by Benjamin Fuller,
Mayank Varia, Arkady Yerukhimovich, Emily
Shen, Ariel Hamlin, Vijay Gadepally, Richard
Shay, John Darby Mitchell, and Robert K.
Cunningham

Database Concepts Notes
No ratings yet
Database Concepts Notes
48 pages
Microsoft SC-900 Slides For Learning
No ratings yet
Microsoft SC-900 Slides For Learning
145 pages
It A-Levels New Edition
100% (3)
It A-Levels New Edition
578 pages
Database Security and SQL Injection
No ratings yet
Database Security and SQL Injection
54 pages
Chap 05
No ratings yet
Chap 05
35 pages
11 - Database and Cloud Security
No ratings yet
11 - Database and Cloud Security
39 pages
Ch6 - Data and Database Administration
No ratings yet
Ch6 - Data and Database Administration
44 pages
Computer Security: Principles and Practice
No ratings yet
Computer Security: Principles and Practice
30 pages
Lecture 8-Data Database Administration v1
No ratings yet
Lecture 8-Data Database Administration v1
14 pages
CHP 12
No ratings yet
CHP 12
48 pages
Lecture1.1 Database Concepts
No ratings yet
Lecture1.1 Database Concepts
61 pages
My SQL
No ratings yet
My SQL
28 pages
Lecture 5 - Database - Security
No ratings yet
Lecture 5 - Database - Security
68 pages
Dbms 1
No ratings yet
Dbms 1
23 pages
Lecture 6 Database Primer
No ratings yet
Lecture 6 Database Primer
50 pages
1 Introduction
No ratings yet
1 Introduction
9 pages
Database Management Systems Guide
No ratings yet
Database Management Systems Guide
71 pages
DBMS Unit 1 1 Final
No ratings yet
DBMS Unit 1 1 Final
38 pages
CS Database Notes
No ratings yet
CS Database Notes
8 pages
Lecture 4 (A B)
No ratings yet
Lecture 4 (A B)
16 pages
Database Management Systems Week 1
No ratings yet
Database Management Systems Week 1
20 pages
Data Security Best Practices
No ratings yet
Data Security Best Practices
27 pages
Introduction To Database
No ratings yet
Introduction To Database
43 pages
Database Management System
No ratings yet
Database Management System
28 pages
Database
No ratings yet
Database
12 pages
2010s The Limitations of NoSQL Syst
No ratings yet
2010s The Limitations of NoSQL Syst
4 pages
Intro 2 DB
No ratings yet
Intro 2 DB
126 pages
Database Slide Book
No ratings yet
Database Slide Book
309 pages
Dataabse Ch1 PDB 2024
No ratings yet
Dataabse Ch1 PDB 2024
61 pages
DBMSC 03 Co 4 NOtes
No ratings yet
DBMSC 03 Co 4 NOtes
3 pages
Dbms Chapter1
No ratings yet
Dbms Chapter1
29 pages
Lecture 5
No ratings yet
Lecture 5
54 pages
DBMS - Part 1 - Introduction
No ratings yet
DBMS - Part 1 - Introduction
45 pages
Database Concepts Till Features of MySQL
No ratings yet
Database Concepts Till Features of MySQL
13 pages
Infromation System1
No ratings yet
Infromation System1
47 pages
Database Concepts for Students
No ratings yet
Database Concepts for Students
10 pages
Introduction To Database Systems: Ruoming Jin TTH 9:15 - 10:30pm Spring 2009 RM MSB115
No ratings yet
Introduction To Database Systems: Ruoming Jin TTH 9:15 - 10:30pm Spring 2009 RM MSB115
54 pages
01 Introduction To Database-SCD
No ratings yet
01 Introduction To Database-SCD
44 pages
Database Management Systems All Weeks
No ratings yet
Database Management Systems All Weeks
77 pages
Screencapture App e Box Co in Amphisession Processsession 205868 2024 02 02 10 - 12 - 33
No ratings yet
Screencapture App e Box Co in Amphisession Processsession 205868 2024 02 02 10 - 12 - 33
15 pages
DBMS Basic Conecpts
No ratings yet
DBMS Basic Conecpts
56 pages
Secure Database Search Insights
No ratings yet
Secure Database Search Insights
20 pages
DBMS 1-4
No ratings yet
DBMS 1-4
36 pages
Antim Prahar 2025 Data Base Management System
No ratings yet
Antim Prahar 2025 Data Base Management System
58 pages
Lecture 1,2
No ratings yet
Lecture 1,2
77 pages
DBMS Introduction Advantages Types File System
No ratings yet
DBMS Introduction Advantages Types File System
30 pages
Module 5 The Art of Ensuring Integrity
No ratings yet
Module 5 The Art of Ensuring Integrity
7 pages
1-Need For Database Systems Characteristics of Database Approach Actors in DBMS Dat
No ratings yet
1-Need For Database Systems Characteristics of Database Approach Actors in DBMS Dat
23 pages
SKILLX Presentation
No ratings yet
SKILLX Presentation
12 pages
INFO 101 Chapter 11 - Databases
No ratings yet
INFO 101 Chapter 11 - Databases
31 pages
Database Management Systems
100% (1)
Database Management Systems
191 pages
Database Management Concepts
No ratings yet
Database Management Concepts
21 pages
504 Lecture2 PDF
No ratings yet
504 Lecture2 PDF
34 pages
Unit 1 2
No ratings yet
Unit 1 2
76 pages
Chapter1 Fundamentalsofdbms 150826164123 Lva1 App6891
No ratings yet
Chapter1 Fundamentalsofdbms 150826164123 Lva1 App6891
59 pages
Database Management System: First Assignment
No ratings yet
Database Management System: First Assignment
8 pages
05 MIS11e - ch03wKeyTermsConceptsReviewedEx
No ratings yet
05 MIS11e - ch03wKeyTermsConceptsReviewedEx
43 pages
Software Engineer Concepts - 4030afdb-00a4-4f83-A520 - 241007 - 202416
No ratings yet
Software Engineer Concepts - 4030afdb-00a4-4f83-A520 - 241007 - 202416
26 pages
Implementation of Product Cipher Using Substitution and Transposition Technique
No ratings yet
Implementation of Product Cipher Using Substitution and Transposition Technique
8 pages
A Review On The Role of Encryption in Mobile Database Security
No ratings yet
A Review On The Role of Encryption in Mobile Database Security
8 pages
Cybersecurity Measures Safeguarding Digital Assets and Mitigating Risks in An Increasingly Interconnected World
No ratings yet
Cybersecurity Measures Safeguarding Digital Assets and Mitigating Risks in An Increasingly Interconnected World
12 pages
Seminar Report
No ratings yet
Seminar Report
29 pages
Hidden CP-ABE with Wildcard Policy
No ratings yet
Hidden CP-ABE with Wildcard Policy
20 pages
Intel® Anti-Theft Service: User Guide
No ratings yet
Intel® Anti-Theft Service: User Guide
32 pages
E-Commerce MCQs for BCA Students
No ratings yet
E-Commerce MCQs for BCA Students
3 pages
Computer Security Essentials
No ratings yet
Computer Security Essentials
19 pages
Amazon S3 FAQs: Storage, Pricing, Regions
No ratings yet
Amazon S3 FAQs: Storage, Pricing, Regions
44 pages
Quiz (Guardium L1) - Attempt Review
No ratings yet
Quiz (Guardium L1) - Attempt Review
11 pages
Cryptography
No ratings yet
Cryptography
12 pages
Citra Log - Txt.old
No ratings yet
Citra Log - Txt.old
5 pages
Attack and Risk Analysis For Hardware Supported Software Copy Protection Systems
No ratings yet
Attack and Risk Analysis For Hardware Supported Software Copy Protection Systems
25 pages
Difference For IEC62351-IEC62443
No ratings yet
Difference For IEC62351-IEC62443
3 pages
IT Security Book
No ratings yet
IT Security Book
62 pages
Message Authentication Code
No ratings yet
Message Authentication Code
4 pages
Class 5-6
No ratings yet
Class 5-6
60 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
2 pages
ELECTIVE 2 Handouts
No ratings yet
ELECTIVE 2 Handouts
10 pages
Malware Detection and Evasion With Machine Learning Techniques: A Survey
No ratings yet
Malware Detection and Evasion With Machine Learning Techniques: A Survey
9 pages
0 Acronis Cloud Handout
No ratings yet
0 Acronis Cloud Handout
88 pages
Unit5 - Data Compression and Cryptography
No ratings yet
Unit5 - Data Compression and Cryptography
59 pages
Niranjan Kumar Singh Encryption Algorithm
No ratings yet
Niranjan Kumar Singh Encryption Algorithm
3 pages
Fibonacci's Natural Wonders
No ratings yet
Fibonacci's Natural Wonders
8 pages
ASUSTOR - Datasheet
No ratings yet
ASUSTOR - Datasheet
36 pages
CTS Possible Technical Interview Questions
No ratings yet
CTS Possible Technical Interview Questions
11 pages
Cryptography & Cybersecurity QBank
No ratings yet
Cryptography & Cybersecurity QBank
7 pages
NihalRamTripathi2 Latest Resume
No ratings yet
NihalRamTripathi2 Latest Resume
2 pages

w7 Encrypted Search

Uploaded by

w7 Encrypted Search

Uploaded by

DS517 – Data Security

You might also like