Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
75 views96 pages

MongoDB Lab Excercise

The document provides solutions to various lab exercises focused on MongoDB operations, including inserting, querying, updating, and deleting documents, as well as using projection and aggregation. It covers practical examples of using MongoDB commands, such as filtering with WHERE clauses, and applying logical operators like AND and OR. Additionally, it demonstrates the creation of collections, indexing, and executing text searches within a catalog data collection.

Uploaded by

architha1105
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
75 views96 pages

MongoDB Lab Excercise

The document provides solutions to various lab exercises focused on MongoDB operations, including inserting, querying, updating, and deleting documents, as well as using projection and aggregation. It covers practical examples of using MongoDB commands, such as filtering with WHERE clauses, and applying logical operators like AND and OR. Additionally, it demonstrates the creation of collections, indexing, and executing text searches within a catalog data collection.

Uploaded by

architha1105
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 96

Solutions to lab exercises

 Question 1

 Illustration of Where Clause, AND,OR operations in MongoDB.

 MongoDB : Insert, Query, Update, Delete and Projection.

 Question 2

 MongoDB query to select and ignore certain fields

 Use of limit and find in MongoDB query

 Question 3

 Execute query selectors (comparison selectors, logical selectors )

 Execute query selectors (Geospatial selectors, Bitwise selectors )

 Question 4

 Projection operators

 Question 5

 Aggregation operations

 Question 6

 Aggregation Pipeline and its operations

 Question 7

 Find all listings

 E-commerce collection reviews summary

 Question 8

 Demonstrate different types of indexes

 Demonstrate optimization of queries using indexes.

 Question 9
 Demonstrate Text search using catalog data collection for a given
word

 Excluding documents with certain words and phrases

 Question 10

 Aggregation pipeline to illustrate Text search on Catalog data


collection.

PART A

Question 1

MongoDB Operations

b. Execute the Commands of MongoDB and operations in MongoDB : Insert,


Query, Update, Delete and Projection. (Note: use any collection)

(part a is answered after part b)

Switch to a Database (Optional):

If you want to use a specific database, switch to that database using


the use command. If the database doesn’t exist, MongoDB will create it implicitly
when you insert data into it:

test> use ProgBooksDB

switched to db ProgBooksDB

ProgBooksDB>

Create the ProgrammingBooks Collection:

To create the ProgrammingBooks collection, use the createCollection() method.


This step is optional because MongoDB will automatically create the collection
when you insert data into it, but you can explicitly create it if needed:
ProgBooksDB> db.createCollection("ProgrammingBooks")

Insert operations

Insert a Single Document into ProgrammingBooks:

Use the insertOne() method to insert a new document into


the ProgrammingBooks collection:

ProgBooksDB> db.ProgrammingBooks.insertOne({

title: "The Pragmatic Programmer: Your Journey to Mastery",

author: "David Thomas, Andrew Hunt",

category: "Software Development",

year: 1999

})

Insert multiple Documents into the ProgrammingBooks Collection :

Now, insert 5 documents representing programming books into


the ProgrammingBooks collection using the insertMany() method:

ProgBooksDB> db.ProgrammingBooks.insertMany([

title: "Clean Code: A Handbook of Agile Software Craftsmanship",

author: "Robert C. Martin",

category: "Software Development",

year: 2008

},

title: "JavaScript: The Good Parts",


author: "Douglas Crockford",

category: "JavaScript",

year: 2008

},

title: "Design Patterns: Elements of Reusable Object-Oriented Software",

author: "Erich Gamma, Richard Helm, Ralph Johnson, John Vlissides",

category: "Software Design",

year: 1994

},

title: "Introduction to Algorithms",

author: "Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, Clifford


Stein",

category: "Algorithms",

year: 1990

},

title: "Python Crash Course: A Hands-On, Project-Based Introduction to


Programming",

author: "Eric Matthes",

category: "Python",

year: 2015
}

])

Query operations

Find All Documents

To retrieve all documents from the ProgrammingBooks collection:

ProgBooksDB> db.ProgrammingBooks.find()

_id: ObjectId('664ee3b1924a8039f62202d8'),

title: 'The Pragmatic Programmer: Your Journey to Mastery',

author: 'David Thomas, Andrew Hunt',

category: 'Software Development',

year: 1999

},

_id: ObjectId('664ee452924a8039f62202d9'),

title: 'Clean Code: A Handbook of Agile Software Craftsmanship',

author: 'Robert C. Martin',

category: 'Software Development',

year: 2008

},

_id: ObjectId('664ee452924a8039f62202da'),
title: 'JavaScript: The Good Parts',

author: 'Douglas Crockford',

category: 'JavaScript',

year: 2008

},

_id: ObjectId('664ee452924a8039f62202db'),

title: 'Design Patterns: Elements of Reusable Object-Oriented Software',

author: 'Erich Gamma, Richard Helm, Ralph Johnson, John Vlissides',

category: 'Software Design',

year: 1994

},

_id: ObjectId('664ee452924a8039f62202dc'),

title: 'Introduction to Algorithms',

author: 'Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, Clifford


Stein',

category: 'Algorithms',

year: 1990

},

_id: ObjectId('664ee452924a8039f62202dd'),
title: 'Python Crash Course: A Hands-On, Project-Based Introduction to
Programming',

author: 'Eric Matthes',

category: 'Python',

year: 2015

Find Documents Matching a Condition

To find books published after the year 2000:

ProgBooksDB> db.ProgrammingBooks.find({ year: { $gt: 2000 } })

_id: ObjectId('664ee452924a8039f62202d9'),

title: 'Clean Code: A Handbook of Agile Software Craftsmanship',

author: 'Robert C. Martin',

category: 'Software Development',

year: 2008

},

_id: ObjectId('664ee452924a8039f62202da'),

title: 'JavaScript: The Good Parts',

author: 'Douglas Crockford',

category: 'JavaScript',
year: 2008

},

_id: ObjectId('664ee452924a8039f62202dd'),

title: 'Python Crash Course: A Hands-On, Project-Based Introduction to


Programming',

author: 'Eric Matthes',

category: 'Python',

year: 2015

Update Operations

a. Update a Single Document

To update a specific book (e.g., change the author of a book):

ProgBooksDB>db.ProgrammingBooks.updateOne(

{ title: "Clean Code: A Handbook of Agile Software Craftsmanship" },

{ $set: { author: "Robert C. Martin (Uncle Bob)" } }

//verify by displaying books published in year 2008

ProgBooksDB> db.ProgrammingBooks.find({ year: { $eq: 2008 } }).pretty()

{
_id: ObjectId('663eaaebae582498972202df'),

title: 'Clean Code: A Handbook of Agile Software Craftsmanship',

author: 'Robert C. Martin (Uncle Bob)',

category: 'Software Development',

year: 2008

},

_id: ObjectId('663eaaebae582498972202e0'),

title: 'JavaScript: The Good Parts',

author: 'Douglas Crockford',

category: 'JavaScript',

year: 2008

//another way to verify

ProgBooksDB> db.ProgrammingBooks.find({ author: { $regex:


"Robert*" } }).pretty()

_id: ObjectId('664ee452924a8039f62202d9'),

title: 'Clean Code: A Handbook of Agile Software Craftsmanship',

author: 'Robert C. Martin (Uncle Bob)',


category: 'Software Development',

year: 2008

b. Update Multiple Documents

To update multiple books (e.g., update the category of books published before
2010):

ProgBooksDB> db.ProgrammingBooks.updateMany(

{ year: { $lt: 2010 } },

{ $set: { category: "Classic Programming Books" } }

//verify the update operation by displaying books published before year 2010

ProgBooksDB> db.ProgrammingBooks.find({ year: { $lt: 2010 } }).pretty()

_id: ObjectId('663eaaebae582498972202df'),

title: 'Clean Code: A Handbook of Agile Software Craftsmanship',

author: 'Robert C. Martin (Uncle Bob)',

category: 'Classic Programming Books',

year: 2008

},

{
_id: ObjectId('663eaaebae582498972202e0'),

title: 'JavaScript: The Good Parts',

author: 'Douglas Crockford',

category: 'Classic Programming Books',

year: 2008

},

_id: ObjectId('663eaaebae582498972202e1'),

title: 'Design Patterns: Elements of Reusable Object-Oriented Software',

author: 'Erich Gamma, Richard Helm, Ralph Johnson, John Vlissides',

category: 'Classic Programming Books',

year: 1994

},

_id: ObjectId('663eaaebae582498972202e2'),

title: 'Introduction to Algorithms',

author: 'Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, Clifford


Stein',

category: 'Classic Programming Books',

year: 1990

},

_id: ObjectId('663eab05ae582498972202e4'),
title: 'The Pragmatic Programmer: Your Journey to Mastery',

author: 'David Thomas, Andrew Hunt',

category: 'Classic Programming Books',

year: 1999

Delete Operations

Delete a Single Document

To delete a specific book from the collection (e.g., delete a book by title):

ProgBooksDB> db.ProgrammingBooks.deleteOne({ title: "JavaScript: The Good


Parts" })

{ acknowledged: true, deletedCount: 1 }

//Verify to see document is deleted

ProgBooksDB> db.ProgrammingBooks.find({ title: "JavaScript: The Good


Parts" }).pretty()

Delete Multiple Documents

To delete multiple books based on a condition (e.g., delete all books published
before 1995):

ProgBooksDB> db.ProgrammingBooks.deleteMany({ year: { $lt: 1995 } })

{ acknowledged: true, deletedCount: 2 }

You can check whether the specified documents were deleted by displaying the
contents of the collection.
Delete All Documents in the Collection:

To delete all documents in a collection (e.g., ProgrammingBooks), use


the deleteMany() method with an empty filter {}:

//delete all documents in a collection

ProgBooksDB> db.ProgrammingBooks.deleteMany({})

{ acknowledged: true, deletedCount: 3 }

//verify by displaying the collection

ProgBooksDB> db.ProgrammingBooks.find().pretty()

Projection Operations

In MongoDB, a projection refers to the mechanism of specifying which fields (or


columns) should be returned from a query result. When querying a collection, you
can use projection to control the shape of the returned documents by specifying
which fields to include or exclude.

In MongoDB, projection is typically specified as the second parameter to


the find() method. The projection parameter takes an object where keys represent
the fields to include or exclude, with values of 1 (include) or 0 (exclude).

Include Specific Fields:

Use 1 to include a field in the result:

ProgBooksDB> db.ProgrammingBooks.find({}, { title: 1, author: 1 } )

_id: ObjectId('665163289edbdf91e12202dd'),

title: 'Clean Code: A Handbook of Agile Software Craftsmanship',

author: 'Robert C. Martin'


},

_id: ObjectId('665163289edbdf91e12202e1'),

title: 'Python Crash Course: A Hands-On, Project-Based Introduction to


Programming',

author: 'Eric Matthes'

Exclude Specific Fields:

Use 0 to exclude a field from the result:

ProgBooksDB> db.ProgrammingBooks.find({}, {year: 0})

_id: ObjectId('665163289edbdf91e12202dd'),

title: 'Clean Code: A Handbook of Agile Software Craftsmanship',

author: 'Robert C. Martin',

category: 'Software Development'

},

_id: ObjectId('665163289edbdf91e12202e1'),

title: 'Python Crash Course: A Hands-On, Project-Based Introduction to


Programming',

author: 'Eric Matthes',


category: 'Python'

Where Clause, AND,OR operations in MongoDB.

a. Illustration of Where Clause, AND,OR operations in MongoDB.

In MongoDB, the equivalent of SQL’s WHERE clause is achieved using query


filters within the find() method. You can also combine multiple conditions using
logical operators like $and and $or. Here’s how you can illustrate the usage of
these features:

Setting Up Example Data

First, let’s assume we have a collection named ProgrammingBooks with the


following documents:

ProgBooksDB> use newDB

switched to db newDB

newDB> db.createCollection("ProgrammingBooks")

{ ok: 1 }

newDB> db.ProgrammingBooks.insertMany([

{ title: "Clean Code", author: "Robert C. Martin", category: "Software


Development", year: 2008 },

{ title: "JavaScript: The Good Parts", author: "Douglas Crockford", category:


"JavaScript", year: 2008 },

{ title: "Design Patterns", author: "Erich Gamma", category: "Software Design",


year: 1994 },

{ title: "Introduction to Algorithms", author: "Thomas H. Cormen", category:


"Algorithms", year: 2009 },
{ title: "Python Crash Course", author: "Eric Matthes", category: "Python", year:
2015 }

]);

acknowledged: true,

insertedIds: {

'0': ObjectId('6651daad9edbdf91e12202e2'),

'1': ObjectId('6651daad9edbdf91e12202e3'),

'2': ObjectId('6651daad9edbdf91e12202e4'),

'3': ObjectId('6651daad9edbdf91e12202e5'),

'4': ObjectId('6651daad9edbdf91e12202e6')

Using the WHERE Clause Equivalent

To query documents with specific conditions, you can use the find() method with a
filter object. For example, to find books published in the year 2008:

newDB> db.ProgrammingBooks.find({ year: 2008 })

_id: ObjectId('6651daad9edbdf91e12202e2'),

title: 'Clean Code',

author: 'Robert C. Martin',


category: 'Software Development',

year: 2008

},

_id: ObjectId('6651daad9edbdf91e12202e3'),

title: 'JavaScript: The Good Parts',

author: 'Douglas Crockford',

category: 'JavaScript',

year: 2008

Using the $and Operator

The $and operator is used to combine multiple conditions that must all be true.
Here’s how to find books that are in the “Software Development” category and
published in the year 2008:

newDB>db.ProgrammingBooks.find({

$and: [

{ category: "Software Development" },

{ year: 2008 }

}).pretty()

[
{

_id: ObjectId('6651daad9edbdf91e12202e2'),

title: 'Clean Code',

author: 'Robert C. Martin',

category: 'Software Development',

year: 2008

In this query:

 Both conditions must be met for a document to be included in the result.

Using the $or Operator

The $or operator is used to combine multiple conditions where at least one must be
true. Here’s how to find books that are either in the “JavaScript” category or
published in the year 2015:

newDB> db.ProgrammingBooks.find({

$or: [

{ category: "JavaScript" },

{ year: 2015 }

}).pretty()

{
_id: ObjectId('6651daad9edbdf91e12202e3'),

title: 'JavaScript: The Good Parts',

author: 'Douglas Crockford',

category: 'JavaScript',

year: 2008

},

_id: ObjectId('6651daad9edbdf91e12202e6'),

title: 'Python Crash Course',

author: 'Eric Matthes',

category: 'Python',

year: 2015

In this query:

 A document will be included in the result if it meets either condition.

Combining $and and $or Operators

You can combine $and and $or operators for more complex queries. For example,
to find books that are either in the “Software Development” category and
published after 2007, or in the “Python” category:

newDB> db.ProgrammingBooks.find({

$or: [

{
$and: [

{ category: "Software Development" },

{ year: { $gt: 2007 } }

},

{ category: "Python" }

}).pretty()

_id: ObjectId('6651daad9edbdf91e12202e2'),

title: 'Clean Code',

author: 'Robert C. Martin',

category: 'Software Development',

year: 2008

},

_id: ObjectId('6651daad9edbdf91e12202e6'),

title: 'Python Crash Course',

author: 'Eric Matthes',

category: 'Python',

year: 2015
}

In this query:

 The document will be included if it meets the combined $and conditions of


being in the “Software Development” category and published after 2007, or
if it is in the “Python” category.

Question 2

a. Select and ignore fields

Develop a MongoDB query to select certain fields and ignore some fields of
the documents from any collection.

To select certain fields and ignore others in MongoDB, you use projections in your
queries. Projections allow you to specify which fields to include or exclude in the
returned documents.

Create database and create the Collection:

test> use MoviesDB

switched to db MoviesDB

MoviesDB> db.createCollection("Movies")

{ ok: 1 }

MoviesDB> db.Movies.insertMany([

{ title: "Inception", director: "Christopher Nolan", genre: "Science Fiction", year:


2010, ratings: { imdb: 8.8, rottenTomatoes: 87 } },

{ title: "The Matrix", director: "Wachowskis", genre: "Science Fiction", year:


1999, ratings: { imdb: 8.7, rottenTomatoes: 87 } },

{ title: "The Godfather", director: "Francis Ford Coppola", genre: "Crime", year:
1972, ratings: { imdb: 9.2, rottenTomatoes: 97 } }
]);

acknowledged: true,

insertedIds: {

'0': ObjectId('66523751d5449c3abf2202d8'),

'1': ObjectId('66523751d5449c3abf2202d9'),

'2': ObjectId('66523751d5449c3abf2202da')

Basic Syntax for Projection

When using the find() method, the first parameter is the query filter, and the
second parameter is the projection object. The projection object specifies the fields
to include (using 1) or exclude (using 0).

Including Specific Fields

To include specific fields, set the fields you want to include to 1:

To select only the title and director fields from the Movies collection:

MoviesDB> db.Movies.find({}, { title: 1, director: 1 })

_id: ObjectId('66523751d5449c3abf2202d8'),

title: 'Inception',

director: 'Christopher Nolan'


},

_id: ObjectId('66523751d5449c3abf2202d9'),

title: 'The Matrix',

director: 'Wachowskis'

},

_id: ObjectId('66523751d5449c3abf2202da'),

title: 'The Godfather',

director: 'Francis Ford Coppola'

MoviesDB> db.Movies.find({}, { title: 1, director: 1, _id: 0 })

{ title: 'Inception', director: 'Christopher Nolan' },

{ title: 'The Matrix', director: 'Wachowskis' },

{ title: 'The Godfather', director: 'Francis Ford Coppola' }

In this query:

 The filter {} means we want to select all documents.

 The projection { title: 1, director: 1, _id: 0 } means we include


the title and director fields, and exclude the _id field (which is included by
default unless explicitly excluded).
Excluding Specific Fields

To exclude specific fields, set the fields you want to exclude to 0:

To exclude the ratings field from the results:

MoviesDB> db.Movies.find({}, { ratings: 0 })

_id: ObjectId('66523751d5449c3abf2202d8'),

title: 'Inception',

director: 'Christopher Nolan',

genre: 'Science Fiction',

year: 2010

},

_id: ObjectId('66523751d5449c3abf2202d9'),

title: 'The Matrix',

director: 'Wachowskis',

genre: 'Science Fiction',

year: 1999

},

_id: ObjectId('66523751d5449c3abf2202da'),

title: 'The Godfather',

director: 'Francis Ford Coppola',


genre: 'Crime',

year: 1972

In this query:

 The filter {} means we want to select all documents.

 The projection { ratings: 0 } means we exclude the ratings field.

Combining Filter and Projection

You can also combine a query filter with a projection. For example, to find movies
directed by “Christopher Nolan” and include only the title and year fields:

MoviesDB> db.Movies.find({ director: "Christopher Nolan" }, { title: 1, year: 1,


_id: 0 })

[ { title: 'Inception', year: 2010 } ]

In this query:

 The filter { director: "Christopher Nolan" } selects documents where


the director is “Christopher Nolan”.

 The projection { title: 1, year: 1, _id: 0 } includes only


the title and year fields and excludes the _id field.

In MongoDB, projections are used to control which fields are included or excluded
in the returned documents. This is useful for optimizing queries and reducing the
amount of data transferred over the network. You specify projections as the second
parameter in the find() method.

b. Use of limit and find in MongoDB query

Develop a MongoDB query to display the first 5 documents from the results
obtained in a. (illustrate use of limit and find)
To display the first 5 documents from a query result in MongoDB, you can use
the limit() method in conjunction with the find() method. The limit() method
restricts the number of documents returned by the query to the specified number.

Example Scenario

Assume we have the Movies collection as described previously:

test> use MoviesDB

switched to db MoviesDB

MoviesDB> db.createCollection("Movies")

{ ok: 1 }

MoviesDB>db.Movies.insertMany([

{ title: "Inception", director: "Christopher Nolan", genre: "Science Fiction", year:


2010, ratings: { imdb: 8.8, rottenTomatoes: 87 } },

{ title: "The Matrix", director: "Wachowskis", genre: "Science Fiction", year:


1999, ratings: { imdb: 8.7, rottenTomatoes: 87 } },

{ title: "The Godfather", director: "Francis Ford Coppola", genre: "Crime", year:
1972, ratings: { imdb: 9.2, rottenTomatoes: 97 } },

{ title: "Pulp Fiction", director: "Quentin Tarantino", genre: "Crime", year: 1994,
ratings: { imdb: 8.9, rottenTomatoes: 92 } },

{ title: "The Shawshank Redemption", director: "Frank Darabont", genre:


"Drama", year: 1994, ratings: { imdb: 9.3, rottenTomatoes: 91 } },

{ title: "The Dark Knight", director: "Christopher Nolan", genre: "Action", year:
2008, ratings: { imdb: 9.0, rottenTomatoes: 94 } },

{ title: "Fight Club", director: "David Fincher", genre: "Drama", year: 1999,
ratings: { imdb: 8.8, rottenTomatoes: 79 } }

]);
Query with Projection and Limit

Suppose you want to display the first 5 documents from the Movies collection,
including only the title, director, and year fields. Here’s how you can do it:

MoviesDB> db.Movies.find({}, { title: 1, director: 1, year: 1, _id: 0 }).limit(5)

{ "title": "Inception", "director": "Christopher Nolan", "year": 2010 },

{ "title": "The Matrix", "director": "Wachowskis", "year": 1999 },

{ "title": "The Godfather", "director": "Francis Ford Coppola", "year": 1972 },

{ "title": "Pulp Fiction", "director": "Quentin Tarantino", "year": 1994 },

{ "title": "The Shawshank Redemption", "director": "Frank Darabont", "year":


1994 }

Explanation:

 find({}): This filter {} selects all documents in the collection.

 { title: 1, director: 1, year: 1, _id: 0 }: This projection includes


the title, director, and year fields, and excludes the _id field.

 .limit(5): This method limits the query result to the first 5 documents.

By using the find() method with a projection and the limit() method, you can
efficiently query and display a subset of documents from a MongoDB collection.
This approach helps manage large datasets by retrieving only a specific number of
documents, which is particularly useful for paginating results in applications.

Question 3

a. Query selectors (comparison selectors, logical selectors )


Execute query selectors (comparison selectors, logical selectors ) and list out
the results on any collection

Let’s create a new collection called Employees and insert some documents into it.
Then, we’ll demonstrate the use of comparison selectors and logical selectors to
query this collection.

Create the Employees Collection and Insert Documents

First, we need to create the Employees collection and insert some sample
documents.

test> use companyDB

companyDB> db.Employees.insertMany([

{ name: "Alice", age: 30, department: "HR", salary: 50000, joinDate: new
Date("2015-01-15") },

{ name: "Bob", age: 24, department: "Engineering", salary: 70000, joinDate: new
Date("2019-03-10") },

{ name: "Charlie", age: 29, department: "Engineering", salary: 75000, joinDate:


new Date("2017-06-23") },

{ name: "David", age: 35, department: "Marketing", salary: 60000, joinDate: new
Date("2014-11-01") },

{ name: "Eve", age: 28, department: "Finance", salary: 80000, joinDate: new
Date("2018-08-19") }

])

acknowledged: true,

insertedIds: {
'0': ObjectId('665356cff5b334bcf92202d8'),

'1': ObjectId('665356cff5b334bcf92202d9'),

'2': ObjectId('665356cff5b334bcf92202da'),

'3': ObjectId('665356cff5b334bcf92202db'),

'4': ObjectId('665356cff5b334bcf92202dc')

Queries Using Comparison Selectors

1. $eq (Equal)

Find employees in the “Engineering” department.

companyDB> db.Employees.find({ department: { $eq: "Engineering" } }).pretty()

_id: ObjectId('665356cff5b334bcf92202d9'),

name: 'Bob',

age: 24,

department: 'Engineering',

salary: 70000,

joinDate: ISODate('2019-03-10T00:00:00.000Z')

},

_id: ObjectId('665356cff5b334bcf92202da'),

name: 'Charlie',
age: 29,

department: 'Engineering',

salary: 75000,

joinDate: ISODate('2017-06-23T00:00:00.000Z')

2. $ne (Not Equal)

Find employees who are not in the “HR” department.

companyDB> db.Employees.find({ department: { $ne: "HR" } }).pretty()

_id: ObjectId('665356cff5b334bcf92202d9'),

name: 'Bob',

age: 24,

department: 'Engineering',

salary: 70000,

joinDate: ISODate('2019-03-10T00:00:00.000Z')

},

_id: ObjectId('665356cff5b334bcf92202da'),

name: 'Charlie',

age: 29,

department: 'Engineering',
salary: 75000,

joinDate: ISODate('2017-06-23T00:00:00.000Z')

},

_id: ObjectId('665356cff5b334bcf92202db'),

name: 'David',

age: 35,

department: 'Marketing',

salary: 60000,

joinDate: ISODate('2014-11-01T00:00:00.000Z')

},

_id: ObjectId('665356cff5b334bcf92202dc'),

name: 'Eve',

age: 28,

department: 'Finance',

salary: 80000,

joinDate: ISODate('2018-08-19T00:00:00.000Z')

3. $gt (Greater Than)

Find employees who are older than 30.

companyDB> db.Employees.find({ age: { $gt: 30 } }).pretty()


[

_id: ObjectId('665356cff5b334bcf92202db'),

name: 'David',

age: 35,

department: 'Marketing',

salary: 60000,

joinDate: ISODate('2014-11-01T00:00:00.000Z')

4. $lt (Less Than)

Find employees with a salary less than 70000.

companyDB> db.Employees.find({ salary: { $lt: 70000 } }).pretty()

_id: ObjectId('665356cff5b334bcf92202d8'),

name: 'Alice',

age: 30,

department: 'HR',

salary: 50000,

joinDate: ISODate('2015-01-15T00:00:00.000Z')

},

{
_id: ObjectId('665356cff5b334bcf92202db'),

name: 'David',

age: 35,

department: 'Marketing',

salary: 60000,

joinDate: ISODate('2014-11-01T00:00:00.000Z')

5. $gte (Greater Than or Equal)

Find employees who joined on or after January 1, 2018.

companyDB> db.Employees.find({ joinDate: { $gte: new Date("2018-01-


01") } }).pretty()

_id: ObjectId('665356cff5b334bcf92202d9'),

name: 'Bob',

age: 24,

department: 'Engineering',

salary: 70000,

joinDate: ISODate('2019-03-10T00:00:00.000Z')

},

_id: ObjectId('665356cff5b334bcf92202dc'),
name: 'Eve',

age: 28,

department: 'Finance',

salary: 80000,

joinDate: ISODate('2018-08-19T00:00:00.000Z')

6. $lte (Less Than or Equal)

Find employees who are 28 years old or younger.

companyDB> db.Employees.find({ age: { $lte: 28 } }).pretty()

_id: ObjectId('665356cff5b334bcf92202d9'),

name: 'Bob',

age: 24,

department: 'Engineering',

salary: 70000,

joinDate: ISODate('2019-03-10T00:00:00.000Z')

},

_id: ObjectId('665356cff5b334bcf92202dc'),

name: 'Eve',

age: 28,
department: 'Finance',

salary: 80000,

joinDate: ISODate('2018-08-19T00:00:00.000Z')

Queries Using Logical Selectors

1. $and (Logical AND)

Find employees who are in the “Engineering” department and have a salary greater
than 70000.

companyDB> db.Employees.find({

$and: [

{ department: "Engineering" },

{ salary: { $gt: 70000 } }

}).pretty()

_id: ObjectId('665356cff5b334bcf92202da'),

name: 'Charlie',

age: 29,

department: 'Engineering',

salary: 75000,
joinDate: ISODate('2017-06-23T00:00:00.000Z')

2. $or (Logical OR)

Find employees who are either in the “HR” department or have a salary less than
60000.

companyDB> db.Employees.find({

$or: [

{ department: "HR" },

{ salary: { $lt: 60000 } }

}).pretty()

_id: ObjectId('665356cff5b334bcf92202d8'),

name: 'Alice',

age: 30,

department: 'HR',

salary: 50000,

joinDate: ISODate('2015-01-15T00:00:00.000Z')

]
3. $not (Logical NOT)

Find employees who are not in the “Engineering” department.

companyDB> db.Employees.find({

department: {

$not: { $eq: "Engineering" }

}).pretty()

_id: ObjectId('665356cff5b334bcf92202d8'),

name: 'Alice',

age: 30,

department: 'HR',

salary: 50000,

joinDate: ISODate('2015-01-15T00:00:00.000Z')

},

_id: ObjectId('665356cff5b334bcf92202db'),

name: 'David',

age: 35,

department: 'Marketing',

salary: 60000,
joinDate: ISODate('2014-11-01T00:00:00.000Z')

},

_id: ObjectId('665356cff5b334bcf92202dc'),

name: 'Eve',

age: 28,

department: 'Finance',

salary: 80000,

joinDate: ISODate('2018-08-19T00:00:00.000Z')

4. $nor (Logical NOR)

Find employees who are neither in the “HR” department nor have a salary greater
than 75000.

companyDB> db.Employees.find({

$nor: [

{ department: "HR" },

{ salary: { $gt: 75000 } }

}).pretty()

{
_id: ObjectId('665356cff5b334bcf92202d9'),

name: 'Bob',

age: 24,

department: 'Engineering',

salary: 70000,

joinDate: ISODate('2019-03-10T00:00:00.000Z')

},

_id: ObjectId('665356cff5b334bcf92202da'),

name: 'Charlie',

age: 29,

department: 'Engineering',

salary: 75000,

joinDate: ISODate('2017-06-23T00:00:00.000Z')

},

_id: ObjectId('665356cff5b334bcf92202db'),

name: 'David',

age: 35,

department: 'Marketing',

salary: 60000,

joinDate: ISODate('2014-11-01T00:00:00.000Z')

}
]

b. Query selectors (Geospatial selectors, Bitwise selectors )

Execute query selectors (Geospatial selectors, Bitwise selectors ) and list out
the results on any collection

Let’s extend our MongoDB examples to include queries using geospatial selectors
and bitwise selectors. We will create a new collection called Places for geospatial
queries and a collection called Devices for bitwise queries.

Geospatial Selectors

First, let’s create a Places collection with geospatial data.

Create the Places Collection and Insert Documents

test> use geoDatabase

switched to db geoDatabase

geoDatabase> db.Places.insertMany([

{ name: "Central Park", location: { type: "Point", coordinates: [-73.9654,


40.7829] } },

{ name: "Times Square", location: { type: "Point", coordinates: [-73.9851,


40.7580] } },

{ name: "Brooklyn Bridge", location: { type: "Point", coordinates: [-73.9969,


40.7061] } },

{ name: "Empire State Building", location: { type: "Point", coordinates: [-


73.9857, 40.7488] } },

{ name: "Statue of Liberty", location: { type: "Point", coordinates: [-74.0445,


40.6892] } }

])
{

acknowledged: true,

insertedIds: {

'0': ObjectId('66536a9799cad9cd2b2202d8'),

'1': ObjectId('66536a9799cad9cd2b2202d9'),

'2': ObjectId('66536a9799cad9cd2b2202da'),

'3': ObjectId('66536a9799cad9cd2b2202db'),

'4': ObjectId('66536a9799cad9cd2b2202dc')

// Create a geospatial index

geoDatabase> db.Places.createIndex({ location: "2dsphere" })

location_2dsphere

Geospatial Queries

1. $near (Find places near a certain point)

Find places near a specific coordinate, for example, near Times Square.

geoDatabase> db.Places.find({

location: {

$near: {

$geometry: {

type: "Point",

coordinates: [-73.9851, 40.7580]


},

$maxDistance: 5000 // distance in meters

}).pretty()

_id: ObjectId('66536a9799cad9cd2b2202d9'),

name: 'Times Square',

location: { type: 'Point', coordinates: [ -73.9851, 40.758 ] }

},

_id: ObjectId('66536a9799cad9cd2b2202db'),

name: 'Empire State Building',

location: { type: 'Point', coordinates: [ -73.9857, 40.7488 ] }

},

_id: ObjectId('66536a9799cad9cd2b2202d8'),

name: 'Central Park',

location: { type: 'Point', coordinates: [ -73.9654, 40.7829 ] }

]
2. $geoWithin (Find places within a specific area)

Find places within a specific polygon, for example, an area covering part of
Manhattan.

geoDatabase> db.Places.find({

location: {

$geoWithin: {

$geometry: {

type: "Polygon",

coordinates: [

[-70.016, 35.715],

[-74.014, 40.717],

[-73.990, 40.730],

[-73.990, 40.715],

[-70.016, 35.715]

}).pretty()

[
{

_id: ObjectId('66536a9799cad9cd2b2202da'),

name: 'Brooklyn Bridge',

location: { type: 'Point', coordinates: [ -73.9969, 40.7061 ] }

Bitwise Selectors

Next, let’s create a Devices collection for bitwise operations.

Create the Devices Collection and Insert Documents

test> use techDB

techDB> db.Devices.insertMany([

{ name: "Device A", status: 5 }, // Binary: 0101

{ name: "Device B", status: 3 }, // Binary: 0011

{ name: "Device C", status: 12 }, // Binary: 1100

{ name: "Device D", status: 10 }, // Binary: 1010

{ name: "Device E", status: 7 } // Binary: 0111

])

Execute Bitwise Queries

1. $bitsAllSet (Find documents where all bits are set)

Find devices where the binary status has both the 1st and 3rd bits set (binary mask
0101, or decimal 5).

techDB> db.Devices.find({
status: { $bitsAllSet: [0, 2] }

}).pretty()

_id: ObjectId('6653703d4e38f292e52202d8'),

name: 'Device A',

status: 5

},

_id: ObjectId('6653703d4e38f292e52202dc'),

name: 'Device E',

status: 7

2. $bitsAnySet (Find documents where any of the bits are set)

Find devices where the binary status has at least the 2nd bit set (binary mask 0010,
or decimal 2).

techDB> db.Devices.find({

status: { $bitsAnySet: [1] }

}).pretty()

[
{

_id: ObjectId('6653703d4e38f292e52202d9'),

name: 'Device B',

status: 3

},

_id: ObjectId('6653703d4e38f292e52202db'),

name: 'Device D',

status: 10

},

_id: ObjectId('6653703d4e38f292e52202dc'),

name: 'Device E',

status: 7

3. $bitsAllClear (Find documents where all bits are clear)

Find devices where the binary status has both the 2nd and 4th bits clear (binary
mask 1010, or decimal 10).

techDB> db.Devices.find({

status: { $bitsAllClear: [1, 3] }

}).pretty()
[

_id: ObjectId('6653703d4e38f292e52202d8'),

name: 'Device A',

status: 5

4. $bitsAnyClear (Find documents where any of the bits are clear)

Find devices where the binary status has at least the 1st bit clear (binary mask
0001, or decimal 1).

techDB> db.Devices.find({

status: { $bitsAnyClear: [0] }

}).pretty()

_id: ObjectId('6653703d4e38f292e52202da'),

name: 'Device C',

status: 12

},

_id: ObjectId('6653703d4e38f292e52202db'),

name: 'Device D',


status: 10

Question 4

Projection Operators

Create and demonstrate how projection operators (,elematch and $slice)


would be used in the MondoDB.

To demonstrate the use of projection operators ($, $elemMatch, and $slice) in


MongoDB, let’s create a Products collection. We’ll insert documents that include
arrays, which will allow us to showcase these operators effectively.

Create the Products Collection and Insert Documents

test> use retailDB

switched to db retailDB

retailDB> db.Products.insertMany([

name: "Laptop",

brand: "BrandA",

features: [

{ name: "Processor", value: "Intel i7" },

{ name: "RAM", value: "16GB" },

{ name: "Storage", value: "512GB SSD" }

],
reviews: [

{ user: "Alice", rating: 5, comment: "Excellent!" },

{ user: "Bob", rating: 4, comment: "Very good" },

{ user: "Charlie", rating: 3, comment: "Average" }

},

name: "Smartphone",

brand: "BrandB",

features: [

{ name: "Processor", value: "Snapdragon 888" },

{ name: "RAM", value: "8GB" },

{ name: "Storage", value: "256GB" }

],

reviews: [

{ user: "Dave", rating: 4, comment: "Good phone" },

{ user: "Eve", rating: 2, comment: "Not satisfied" }

])

Use Projection Operators

1. The $ Projection Operator


The $ operator is used to project the first matching element from an array of
embedded documents.

Example: Find the product named “Laptop” and project the review from the user
“Alice”.

retailDB> db.Products.find(

{ name: "Laptop", "reviews.user": "Alice" },

{ "reviews.$": 1 }

).pretty()

Result:

"_id": ObjectId("..."),

"reviews": [

{ "user": "Alice", "rating": 5, "comment": "Excellent!" }

2. The $elemMatch Projection Operator

The $elemMatch operator is used to project the first matching element from an
array based on specified criteria.

Example: Find the product named “Laptop” and project the review where the
rating is greater than 4.

retailDB> db.Products.find(

{ name: "Laptop" },

{ reviews: { $elemMatch: { rating: { $gt: 4 } } } }

).pretty()
Result:

"_id": ObjectId("..."),

"reviews": [

{ "user": "Alice", "rating": 5, "comment": "Excellent!" }

3. The $slice Projection Operator

The $slice operator is used to include a subset of the array field.

Example: Find the product named “Smartphone” and project the first review.

retailDB> db.Products.find(

{ name: "Smartphone" },

{ reviews: { $slice: 1 } }

).pretty()

Result:

"_id": ObjectId("..."),

"reviews": [

{ "user": "Dave", "rating": 4, "comment": "Good phone" }

Additional Example with Multiple Projection Operators


Example: Find the product named “Laptop” and project the name, the first two
features, and the review with the highest rating.

retailDB> db.Products.find(

{ name: "Laptop" },

name: 1,

features: { $slice: 2 },

reviews: { $elemMatch: { rating: 5 } }

).pretty()

Result:

"_id": ObjectId("..."),

"name": "Laptop",

"features": [

{ "name": "Processor", "value": "Intel i7" },

{ "name": "RAM", "value": "16GB" }

],

"reviews": [

{ "user": "Alice", "rating": 5, "comment": "Excellent!" }

}
Using projection operators in MongoDB, you can fine-tune the data returned by
your queries:

 The $ operator is useful for projecting the first matching element from an
array.

 The $elemMatch operator allows you to project the first array element that
matches specified criteria.

 The $slice operator lets you project a subset of an array, such as the
first n elements or a specific range.

Question 5

Aggregation operations

Execute Aggregation operations (avg,min,max,push, $addToSet etc.). students


encourage to execute several queries to demonstrate various aggregation operators)

To demonstrate aggregation operations such as $avg, $min, $max, $push,


and $addToSet in MongoDB, we will use a Sales collection. This collection will
contain documents representing sales transactions.

Create the Sales Collection and Insert Documents

First, we’ll create the Sales collection and insert sample documents.

test> use salesDB

salesDB> db.Sales.insertMany([

{ date: new Date("2024-01-01"), product: "Laptop", price: 1200, quantity: 1,


customer: "Amar" },

{ date: new Date("2024-01-02"), product: "Laptop", price: 1200, quantity: 2,


customer: "Babu" },
{ date: new Date("2024-01-03"), product: "Mouse", price: 25, quantity: 5,
customer: "Chandra" },

{ date: new Date("2024-01-04"), product: "Keyboard", price: 45, quantity: 3,


customer: "Amar" },

{ date: new Date("2024-01-05"), product: "Monitor", price: 300, quantity: 1,


customer: "Babu" },

{ date: new Date("2024-01-06"), product: "Laptop", price: 1200, quantity: 1,


customer: "Deva" }

])

Execute Aggregation Operations

1. $avg (Average)

Calculate the average price of each product.

salesDB> db.Sales.aggregate([

$group: {

_id: "$product",

averagePrice: { $avg: "$price" }

]).pretty()

Result:

{ "_id": "Laptop", "averagePrice": 1200 },

{ "_id": "Mouse", "averagePrice": 25 },


{ "_id": "Keyboard", "averagePrice": 45 },

{ "_id": "Monitor", "averagePrice": 300 }

2. $min (Minimum)

Find the minimum price of each product.

salesDB> db.Sales.aggregate([

$group: {

_id: "$product",

minPrice: { $min: "$price" }

]).pretty()

Result:

{ "_id": "Laptop", "minPrice": 1200 },

{ "_id": "Mouse", "minPrice": 25 },

{ "_id": "Keyboard", "minPrice": 45 },

{ "_id": "Monitor", "minPrice": 300 }

3. $max (Maximum)

Find the maximum price of each product.

salesDB> db.Sales.aggregate([
{

$group: {

_id: "$product",

maxPrice: { $max: "$price" }

]).pretty()

Result:

{ "_id": "Laptop", "maxPrice": 1200 },

{ "_id": "Mouse", "maxPrice": 25 },

{ "_id": "Keyboard", "maxPrice": 45 },

{ "_id": "Monitor", "maxPrice": 300 }

4. $push (Push Values to an Array)

Group sales by customer and push each purchased product into an array.

salesDB> db.Sales.aggregate([

$group: {

_id: "$customer",

products: { $push: "$product" }

}
]).pretty()

Result:

{ "_id": "Amar", "products": ["Laptop", "Keyboard"] },

{ "_id": "Babu", "products": ["Laptop", "Monitor"] },

{ "_id": "Chandra", "products": ["Mouse"] },

{ "_id": "Deva", "products": ["Laptop"] }

5. $addToSet (Add Unique Values to an Array)

Group sales by customer and add each unique purchased product to an array.

salesDB> db.Sales.aggregate([

$group: {

_id: "$customer",

uniqueProducts: { $addToSet: "$product" }

]).pretty()

Result:

{ "_id": "Amar", "uniqueProducts": ["Laptop", "Keyboard"] },

{ "_id": "Babu", "uniqueProducts": ["Laptop", "Monitor"] },

{ "_id": "Chandra", "uniqueProducts": ["Mouse"] },


{ "_id": "Deva", "uniqueProducts": ["Laptop"] }

Combining Aggregation Operations

Let’s combine several aggregation operations to get a comprehensive report.

Example: Calculate the total quantity and total sales amount for each product, and
list all customers who purchased each product.

salesDB> db.Sales.aggregate([

$group: {

_id: "$product",

totalQuantity: { $sum: "$quantity" },

totalSales: { $sum: { $multiply: ["$price", "$quantity"] } },

customers: { $addToSet: "$customer" }

]).pretty()

Result:

"_id": "Laptop",

"totalQuantity": 4,

"totalSales": 4800,

"customers": ["Amar", "Babu", "Deva"]


},

"_id": "Mouse",

"totalQuantity": 5,

"totalSales": 125,

"customers": ["Chandra"]

},

"_id": "Keyboard",

"totalQuantity": 3,

"totalSales": 135,

"customers": ["Amar"]

},

"_id": "Monitor",

"totalQuantity": 1,

"totalSales": 300,

"customers": ["Babu"]

By using aggregation operations such as $avg, $min, $max, $push,


and $addToSet, you can perform complex data analysis and transformations on
MongoDB collections. These operations enable you to calculate averages, find
minimum and maximum values, push values into arrays, and create sets of unique
values. The examples provided show how to use these operators to analyze
a Sales collection

Question 6

Aggregation Pipeline and its operations

Execute Aggregation Pipeline and its operations (pipeline must


contain match,group, sort,project, $skip etc.)

Let’s consider a scenario involving a restaurantDB database with


a restaurants collection. Each document in the restaurants collection contains
details about a restaurant, including its name, cuisine, location, and an array of
reviews. Each review includes a rating and a comment. After creating
the restaurantDB database and insert sample documents into
the restaurants collection we will create an aggregation pipeline as shown below.

// Switch to the restaurantDB database

use restaurantDB

// Insert sample documents into the restaurants collection

db.restaurants.insertMany([

name: "Biryani House",

cuisine: "Indian",

location: "Jayanagar",

reviews: [

{ user: "Aarav", rating: 5, comment: "Amazing biryani!" },

{ user: "Bhavana", rating: 4, comment: "Great place!" }


]

},

name: "Burger Joint",

cuisine: "American",

location: "Koramangala",

reviews: [

{ user: "Chirag", rating: 3, comment: "Average burger" },

{ user: "Devika", rating: 4, comment: "Good value" }

},

name: "Pasta House",

cuisine: "Italian",

location: "Rajajinagar",

reviews: [

{ user: "Esha", rating: 5, comment: "Delicious pasta!" },

{ user: "Farhan", rating: 4, comment: "Nice ambiance" }

},

name: "Curry Palace",

cuisine: "Indian",
location: "Jayanagar",

reviews: [

{ user: "Gaurav", rating: 4, comment: "Spicy and tasty!" },

{ user: "Harini", rating: 5, comment: "Best curry in town!" }

},

name: "Taco Stand",

cuisine: "Mexican",

location: "Jayanagar",

reviews: [

{ user: "Ishaan", rating: 5, comment: "Fantastic tacos!" },

{ user: "Jaya", rating: 4, comment: "Very authentic" }

])

// Run the aggregation pipeline query to display reviews summary

db.restaurants.aggregate([

$match: {

location: "Jayanagar"

}
},

$unwind: "$reviews"

},

$group: {

_id: "$name",

averageRating: { $avg: "$reviews.rating" },

totalReviews: { $sum: 1 }

},

$sort: {

averageRating: -1

},

$project: {

_id: 0,

restaurant: "$_id",

averageRating: 1,

totalReviews: 1

}
},

$skip: 1

]).pretty()

Now, let’s execute an aggregation pipeline that includes


the $match, $unwind, $group, $sort, $project, and $skip stages.

Aggregation Pipeline Explanation

1. $match: Filter restaurants by cuisine ("Jayanagar" location).

2. $unwind: Deconstruct the reviews array from each document to output a


document for each review.

3. $group: Group the documents by restaurant name and calculate the average
rating and total number of reviews.

4. $sort: Sort the results by average rating in descending order.

5. $project: Restructure the output to include only the restaurant name,


average rating, and total reviews.

6. $skip: Skip the first document.

Question 7

a. Find all listings

Find all listings with listing_url, name, address, host_picture_url in the


listings And Reviews collection that have a host with a picture url

To find all listings with listing_url, name, address, and host_picture_url in


the listingsAndReviews collection where the host has a picture URL, let is create
appropriate databases and queries as follows.
Create the Database

First, switch to or create the database you want to use. For this example, let’s call
the database vacationRentals.

test> use vacationRentals

switched to db vacationRentals

vacationRentals>

Create the listingsAndReviews Collection and Insert Documents

Next, create the listingsAndReviews collection and insert sample documents. Here
are a few example documents to illustrate the structure:

vacationRentals> db.listingsAndReviews.insertMany([

listing_url: "http://www.example.com/listing/123456",

name: "Beautiful Apartment",

address: {

street: "123 Main Street",

suburb: "Central",

city: "Metropolis",

country: "Wonderland"

},

host: {

name: "Alice",

picture_url: "http://www.example.com/images/host/host123.jpg"

}
},

listing_url: "http://www.example.com/listing/654321",

name: "Cozy Cottage",

address: {

street: "456 Another St",

suburb: "North",

city: "Smallville",

country: "Wonderland"

},

host: {

name: "Bob",

picture_url: ""

},

listing_url: "http://www.example.com/listing/789012",

name: "Modern Condo",

address: {

street: "789 Side Road",

suburb: "East",

city: "Gotham",

country: "Wonderland"
},

host: {

name: "Charlie",

picture_url: "http://www.example.com/images/host/host789.jpg"

])

Query to Find Listings with Host Picture URLs

Now that the collection is set up, you can run the query to find all listings
with listing_url, name, address, and host_picture_url where the host has a
picture URL.

db.listingsAndReviews.find(

"host.picture_url": { $exists: true, $ne: "" }

},

listing_url: 1,

name: 1,

address: 1,

"host.picture_url": 1

).pretty()

Explanation:
 Query Filter:

 "host.picture_url": { $exists: true, $ne: "" }: This part of the query


ensures that only documents where the host.picture_url field exists
and is not an empty string are selected.

 Projection:

 { listing_url: 1, name: 1, address: 1, "host.picture_url": 1 }: This


part of the query specifies the fields to include in the output.
The 1 indicates that these fields should be included.

Expected Result

The query should return documents where the host has a picture URL. Based on
the inserted documents, the result should look something like this:

"_id": ObjectId("..."),

"listing_url": "http://www.example.com/listing/123456",

"name": "Beautiful Apartment",

"address": {

"street": "123 Main Street",

"suburb": "Central",

"city": "Metropolis",

"country": "Wonderland"

},

"host": {

"picture_url": "http://www.example.com/images/host/host123.jpg"

}
}

"_id": ObjectId("..."),

"listing_url": "http://www.example.com/listing/789012",

"name": "Modern Condo",

"address": {

"street": "789 Side Road",

"suburb": "East",

"city": "Gotham",

"country": "Wonderland"

},

"host": {

"picture_url": "http://www.example.com/images/host/host789.jpg"

b. E-commerce collection

Using E-commerce collection write a query to display reviews summary.

To display a summary of reviews in an e-commerce collection, we can assume


the ecommerce database contains a products collection with documents structured
to include reviews. Each product document could have a reviews array with review
details such as rating, comment, and user.

// Switch to the ecommerce database

use ecommerce
// Insert sample documents into the products collection

db.products.insertMany([

product_id: 1,

name: "Laptop",

category: "Electronics",

price: 1200,

reviews: [

{ user: "Alice", rating: 5, comment: "Excellent!" },

{ user: "Bob", rating: 4, comment: "Very good" },

{ user: "Charlie", rating: 3, comment: "Average" }

},

product_id: 2,

name: "Smartphone",

category: "Electronics",

price: 800,

reviews: [

{ user: "Dave", rating: 4, comment: "Good phone" },

{ user: "Eve", rating: 2, comment: "Not satisfied" },

{ user: "Frank", rating: 5, comment: "Amazing!" }

]
},

product_id: 3,

name: "Headphones",

category: "Accessories",

price: 150,

reviews: [

{ user: "Grace", rating: 5, comment: "Great sound" },

{ user: "Heidi", rating: 3, comment: "Okay" }

])

// Run the aggregation query to display reviews summary

db.products.aggregate([

$unwind: "$reviews"

},

$group: {

_id: "$name",

totalReviews: { $sum: 1 },

averageRating: { $avg: "$reviews.rating" },


comments: { $push: "$reviews.comment" }

},

$project: {

_id: 0,

product: "$_id",

totalReviews: 1,

averageRating: 1,

comments: 1

]).pretty()

This script will set up the ecommerce database, populate the products collection
with sample data, and execute an aggregation query to summarize the reviews.

Explanation:

1. $unwind: Deconstructs the reviews array from each document to output a


document for each element.

2. $group: Groups the documents by product name, and calculates:

 totalReviews: The total number of reviews for each product.

 averageRating: The average rating of the reviews for each product.

 comments: An array of all review comments for each product.

3. $project: Restructures the output documents to include the product name,


total reviews, average rating, and comments.
Sample Output:

The query will return a summary for each product in the collection:

"product": "Laptop",

"totalReviews": 3,

"averageRating": 4,

"comments": [

"Excellent!",

"Very good",

"Average"

},

"product": "Smartphone",

"totalReviews": 3,

"averageRating": 3.6666666666666665,

"comments": [

"Good phone",

"Not satisfied",

"Amazing!"

},
{

"product": "Headphones",

"totalReviews": 2,

"averageRating": 4,

"comments": [

"Great sound",

"Okay"

Question 8

a. Demonstrate different types of indexes

Demonstrate creation of different types of indexes on collection (unique,


sparse, compound and multikey indexes)

Let’s demonstrate the creation of various types of indexes on


a restaurants collection in the restaurantDB database. We’ll cover unique,
sparse, compound, and multikey indexes.

Step 1: Create the Database and Collection

First, let’s set up the restaurantDB database and insert sample documents into
the restaurants collection.

// Switch to the restaurantDB databasenuse restaurantDBnn// Insert sample


documents into the restaurants collectionndb.restaurants.insertMany([n {n name:
u0022Biryani Houseu0022,n cuisine: u0022Indianu0022,n location:
u0022Downtownu0022,n reviews: [n { user: u0022Aaravu0022, rating: 5,
comment: u0022Amazing biryani!u0022 },n { user: u0022Bhavanau0022,
rating: 4, comment: u0022Great place!u0022 }n ],n contact: { phone:
u00221234567890u0022, email: [email protected] }n },n
{n name: u0022Curry Palaceu0022,n cuisine: u0022Indianu0022,n location:
u0022Downtownu0022,n reviews: [n { user: u0022Gauravu0022, rating: 4,
comment: u0022Spicy and tasty!u0022 },n { user: u0022Hariniu0022, rating: 5,
comment: u0022Best curry in town!u0022 }n ],n contact: { phone:
u00220987654321u0022, email: [email protected] }n },n {n
name: u0022Taco Standu0022,n cuisine: u0022Mexicanu0022,n location:
u0022Downtownu0022,n reviews: [n { user: u0022Ishaanu0022, rating: 5,
comment: u0022Fantastic tacos!u0022 },n { user: u0022Jayau0022, rating: 4,
comment: u0022Very authenticu0022 }n ],n contact: { phone:
u00221122334455u0022, email: [email protected] }n }n])

Step 2: Create Various Indexes

1. Unique Index

A unique index ensures that the indexed field does not contain duplicate values.

// Create a unique index on the contact.email


fieldndb.restaurants.createIndex({ u0022contact.emailu0022: 1 }, { unique: true })

2. Sparse Index

A sparse index only includes documents that contain the indexed field, ignoring
documents where the field is missing.

// Create a sparse index on the location fieldndb.restaurants.createIndex({ location:


1 }, { sparse: true })

3. Compound Index

A compound index indexes multiple fields within a single index.

// Create a compound index on the name and location


fieldsndb.restaurants.createIndex({ name: 1, location: 1 })

4. Multikey Index

A multikey index is created on an array field, indexing each element of the array.
// Create a multikey index on the reviews
fieldndb.restaurants.createIndex({ reviews: 1 })

Step 3: Verify Indexes

To verify the created indexes, you can use the getIndexes method.

// Verify the created indexesndb.restaurants.getIndexes()

Output

testu003e use restaurantDBnswitched to db restaurantDBnrestaurantDBu003e


db.restaurants.insertMany([n... {n... name: u0022Biryani Houseu0022,n...
cuisine: u0022Indianu0022,n... location: u0022Downtownu0022,n... reviews:
[n... { user: u0022Aaravu0022, rating: 5, comment: u0022Amazing biryani!
u0022 },n... { user: u0022Bhavanau0022, rating: 4, comment: u0022Great
place!u0022 }n... ],n... contact: { phone: u00221234567890u0022, email:
[email protected] }n... },n... {n... name: u0022Curry
Palaceu0022,n... cuisine: u0022Indianu0022,n... location:
u0022Downtownu0022,n... reviews: [n... { user: u0022Gauravu0022, rating:
4, comment: u0022Spicy and tasty!u0022 },n... { user: u0022Hariniu0022,
rating: 5, comment: u0022Best curry in town!u0022 }n... ],n... contact:
{ phone: u00220987654321u0022, email:
[email protected] }n... },n... {n... name: u0022Taco
Standu0022,n... cuisine: u0022Mexicanu0022,n... location:
u0022Downtownu0022,n... reviews: [n... { user: u0022Ishaanu0022, rating:
5, comment: u0022Fantastic tacos!u0022 },n... { user: u0022Jayau0022, rating:
4, comment: u0022Very authenticu0022 }n... ],n... contact: { phone:
u00221122334455u0022, email:
[email protected] }n... }n... ])n{n acknowledged: true,n
insertedIds: {n '0': ObjectId('667b3c596809bfbfae149f48'),n '1':
ObjectId('667b3c596809bfbfae149f49'),n '2':
ObjectId('667b3c596809bfbfae149f4a')n }n}nrestaurantDBu003e
db.restaurants.createIndex({ u0022contact.emailu0022: 1 }, { unique:
true })ncontact.email_1nrestaurantDBu003e db.restaurants.createIndex({ location:
1 }, { sparse: true })nlocation_1nrestaurantDBu003e
db.restaurants.createIndex({ name: 1, location:
1 })nname_1_location_1nrestaurantDBu003e db.restaurants.createIndex({ reviews:
1 })nreviews_1nrestaurantDBu003e db.restaurants.getIndexes()n[n { v: 2, key:
{ _id: 1 }, name: '_id_' },n {n v: 2,n key: { 'contact.email': 1 },n name:
'contact.email_1',n unique: truen },n { v: 2, key: { location: 1 }, name:
'location_1', sparse: true },n { v: 2, key: { name: 1, location: 1 }, name:
'name_1_location_1' },n { v: 2, key: { reviews: 1 }, name:
'reviews_1' }n]nrestaurantDBu003e n

This script sets up the restaurantDB database, populates


the restaurants collection with sample data, and demonstrates the creation of
unique, sparse, compound, and multikey indexes. The getIndexes method at the
end allows you to verify the indexes created on the collection.

b. Demonstrate optimization of queries using indexes.

To demonstrate the optimization of queries using indexes, we have to use a fairly


large database where query execution times are longer. For the following
examples, we’ll use a dataset of daily NASDAQ summaries. To follow along,
you’ll need this data locally.

Step 1: Create the Database and Collection

First, download the archive from here . Then, unzip the file to a temporary folder.

$ unzip stocks.zip nArchive: stocks.zipn creating: dump/stocks/n inflating:


dump/stocks/system.indexes.bson n inflating: dump/stocks/values.bson

Now its time to import this stocks database into the MongoDB using
the mongorestore command. After that switch to the stocks database.

$ mongorestore -d stocks dump/stocksnn$ mongoshnntestu003e use


stocksnswitched to db stocks

Lets have a look at the structure of this database to find the various fields using the
following command. The stocks database has a value collection that contains, for a
certain subset of the NASDAQ stock exchange’s symbols, there’s a document for
each day’s high, low, close, and volume for a 25-year period beginning in 1983.
stocksu003e show collectionsnvaluesnnkeys = db.values.findOne()n{n _id:
ObjectId('4d094f58c96767d7a0099d49'),n exchange: 'NASDAQ',n stock_symbol:
'AACC',n date: '2008-03-07',n open: 8.4,n high: 8.75,n low: 8.08,n close: 8.55,n
volume: 275800,n 'adj close': 8.55n}nnstocksu003e
db.values.countDocuments()n4308303n

You can also see that this database has more than four million records/documents
and a huge amount of information in it. Queries run on such databases usually take
more time to execute. For example if we want to find out the first occurrence of
Google’s stock price, we issue the following query.

stocks> db.values.find({"stock_symbol": "GOOG"}).sort({date: -1}).limit(1)

_id: ObjectId('4d094f7ec96767d7a02a0af6'),

exchange: 'NASDAQ',

stock_symbol: 'GOOG',

date: '2008-03-07',

open: 428.88,

high: 440,

low: 426.24,

close: 433.35,

volume: 8071800,

'adj close': 433.35

]
We observe that this query takes time to run. Lets see if we can actually measure
the time taken. Fortunately we have a explain() method. MongoDB’s explain
command provides detailed information about a given query execution details. It
provides even more detail with the executionStats parameter. Let us see an
example

Step 2: Issue a slow query

Let us develop a query to finding the highest closing price in the data set.

stocks> db.values.find({}).sort({close: -1}).limit(1)

When we execute this query you observe it takes more time to execute and produce
the following result.

_id: ObjectId('4d094fc2c96767d7a0360a64'),

exchange: 'NASDAQ',

stock_symbol: 'BORD',

date: '2000-09-20',

open: 7500,

high: 7500,

low: 7500,

close: 7500,

volume: 400,

'adj close': 6679.94

]
Step 3: Execution statistics

To obtain execution statistics we need to append the explain method. To get more
details pass executionStats to the explain method.

stocks> db.values.find({}).sort({close: -1}).limit(1).explain()

stocks> db.values.find({}).sort({close: -1}).limit(1).explain("executionStats")

You can see from its output that it provides us with a wealth of information
regarding the query. But what we are interested is in the execution time and
number of documents it has scanned to obtain the result. These information can be
obtained as follows.

stocks> db.values.find({}).sort({close: -
1}).limit(1).explain("executionStats").executionStats.totalDocsExamined

4308303

stocks> db.values.find({}).sort({close: -
1}).limit(1).explain("executionStats").executionStats.executionTimeMillis

1831

We see that it has scanned all the documents(4308303) to arrive at the result and it
has taken 1.8 seconds. The reason for this performance is that the database is not
indexed. Let us now index and see the performance.

Step 2: Create a Index and optimize query performance

We will add an index on the close field.

stocks> db.values.createIndex({close: 1})

close_1

We will rerun the queries now and examine the query performance.
stocks> db.values.find({}).sort({close: -
1}).limit(1).explain("executionStats").executionStats.totalDocsExamined

stocks> db.values.find({}).sort({close: -
1}).limit(1).explain("executionStats").executionStats.executionTimeMillis

57

Now after the database is indexed we see a huge improvement in performance. The
no of records scanned is just 1 and query took only 57 milliseconds to execute.

Question 9

a. Develop a query to demonstrate Text search using catalog data collection


for a given word.

To demonstrate text search in MongoDB using a catalog collection, we’ll follow


these steps:

1. Create the catalog collection and insert sample documents.

2. Create a text index on the relevant fields.

3. Perform a text search query to find documents containing a specific


word.

For this example let us consider a movie database that has been imported from a
CSV file. We can import data from the CSV file using the mongoimport utility as
follows:

The CSV file kan_movies.csv is provided below for your reference.

kan_moviesDownload

Step 1: Importing from CSV file using mongoimport into


the catalog Collection
mongoimport --db=kannadaMoviesDB --collection='catalog' --
file=kan_movies.csv --type=csv --
fields="name","year","duration","rating","genre","lang"

 –db parameter is used to specify the database into which data is to be


imported.

 –file parameter is used to specify the file from which data is to be imported

 –type parameter is used to specify the file type (csv, json,…..)

 –collection parameter is used to specify the collection into which data is to


be imported.

 –fields parameter is used to specify a list of strings that are field names in
the collection.

On success you should get the following output.

2024-06-29T02:53:01.473+0530 connected to: mongodb://localhost/

2024-06-29T02:53:02.252+0530 701 document(s) imported successfully. 0


document(s) failed to import.

Now launch MongoDB and choose the newly created database as follows:

test> use kannadaMoviesDB

switched to db kannadaMoviesDB

kannadaMoviesDB> show collections

catalog

kannadaMoviesDB> db.catalog.countDocuments()

701
Alternatively you can create a catalog collection by adding documents
using insertMany query as done in previous exercises.

Step 2: Create a Text Index

Next, create a text index on the name and genre fields to enable text search.

// Create a text index on the name and genre fields

db.catalog.createIndex({name: "text", genre: "text"})

Step 3: Perform a Text Search Query

Now, let’s perform a text search to find documents containing a specific word. For
example, let’s search for the word “maga”.

// Perform a text search query to find documents containing the word "maga"

db.catalog.find({$text: {$search: "maga"}})

Output

_id: ObjectId('667f29b50c118ded9b39bd44'),

name: 'Jayammana Maga',

year: 2013,

duration: '139 min',

rating: 7.1,

genre: 'Drama',

lang: 'kannada'

},

{
_id: ObjectId('667f29b50c118ded9b39bc7b'),

name: 'Rajannana Maga',

year: 2018,

duration: '143 min',

rating: 7.8,

genre: 'Action',

lang: 'kannada'

},

_id: ObjectId('667f29b50c118ded9b39beea'),

name: 'Thayige thakka maga',

year: 2018,

duration: '147 min',

rating: 5.4,

genre: 'Drama',

lang: 'kannada'

},

_id: ObjectId('667f29b50c118ded9b39bd97'),

name: 'Bhootayyana Maga Ayyu',

year: 1974,

duration: '155 min',

rating: 8.3,
genre: 'Drama',

lang: 'kannada'

},

_id: ObjectId('667f29b50c118ded9b39bd31'),

name: 'Jaga Mechida Maga',

year: 1972,

duration: '153 min',

rating: 8.2,

genre: 'Drama',

lang: 'kannada'

},

_id: ObjectId('667f29b50c118ded9b39bd11'),

name: 'Daari Tappida Maga',

year: 1975,

duration: '138 min',

rating: 7.6,

genre: 'Crime, Drama ',

lang: 'kannada'

// Perform a text search query to find documents containing the word "raju"
db.catalog.find({$text: {$search: "raju"}})

Output

_id: ObjectId('667f29b50c118ded9b39bd33'),

name: 'Raju Kannada Medium',

year: 2018,

duration: '159 min',

rating: 7.2,

genre: 'Comedy, Drama ',

lang: 'kannada'

},

_id: ObjectId('667f29b50c118ded9b39bcd0'),

name: 'First Rank Raju',

year: 2015,

duration: '148 min',

rating: 7.8,

genre: 'Comedy, Drama ',

lang: 'kannada'

Step 4: Perform a Text Search Query for a phrase


Now, let’s perform a text search to find documents containing a specific phrase.
For example, let’s search for the phrase “tappida Maga”.

// Perform a text search query to find documents containing the phrase "maga"

db.catalog.find({$text: {$search: ""tappida Maga""}})

Output

_id: ObjectId('667f29b50c118ded9b39bd11'),

name: 'Daari Tappida Maga',

year: 1975,

duration: '138 min',

rating: 7.6,

genre: 'Crime, Drama ',

lang: 'kannada'

Explanation

1. Inserting Documents: We insert several documents into


the catalog collection with fields name,year,duration,rating,genre,
and lang.

2. Creating a Text Index: We create a text index on


the name and description fields to enable text search.

3. Performing a Text Search: We use the $text operator with


the $search parameter to find documents that contain the word “raju” in
either the name or genre fields.
This script sets up the catalog collection, creates a text index, and demonstrates a
text search query to find documents containing a specific word.

b. Develop queries to illustrate excluding documents with certain words and


phrases

To exclude documents containing certain words or phrases in MongoDB, you can


use the $text operator combined with the $search parameter and the negation (-)
operator. This allows you to perform text searches that exclude specific terms.

A negated term is a term that is prefixed by a minus sign -. If you negate a term,
the $text operator excludes the documents that contain those terms from the
results.

Step-by-Step Process

1. Set up the catalog collection: Insert sample documents.

2. Create a text index: Enable text search.

3. Perform queries to exclude documents: Use the $text operator with


negation.

Step 1: Create a catalog collection

For this we will use the same catalog collection from our previous example. You
can follow the same steps as earlier to create the collection.

Step 2: Create a Text Index

Create a text index on the name and description fields.

// Create a text index on the name and description fields

db.catalog.createIndex({ name: "text", description: "text" })

Step 3: Perform Queries to Exclude Documents

Use the $text operator with negation to exclude documents containing specific
words or phrases.

Example 1: Exclude Documents Containing the Word “action”


Suppose we want to list movies that belong to crime or romance (or both) genre
but not belonging to the action genre. Since this will yield too many results we will
restrict the search to the year 2021.

// Exclude documents containing the word "action"

db.catalog.find({ $text: { $search: "crime romance -action" }, year:2021 } )

Output

_id: ObjectId('667f29b50c118ded9b39bcaa'),

name: 'Raktha Gulabi',

year: 2021,

duration: '132 min',

rating: 9.4,

genre: 'Crime',

lang: 'kannada'

},

_id: ObjectId('667f29b50c118ded9b39bd82'),

name: 'Laddu',

year: 2021,

duration: '127 min',

rating: 8.6,

genre: 'Comedy, Romance ',


lang: 'kannada'

Example 2: Exclude Documents Containing the Phrase “da maga”

We display those documents that have the word maga but not the phrase da maga.

// Exclude documents containing the phrase "da maga"

db.catalog.find({$text: {$search: "maga -"da maga""}})

Output

_id: ObjectId('667f29b50c118ded9b39bd44'),

name: 'Jayammana Maga',

year: 2013,

duration: '139 min',

rating: 7.1,

genre: 'Drama',

lang: 'kannada'

},

_id: ObjectId('667f29b50c118ded9b39bc7b'),

name: 'Rajannana Maga',

year: 2018,

duration: '143 min',


rating: 7.8,

genre: 'Action',

lang: 'kannada'

},

_id: ObjectId('667f29b50c118ded9b39beea'),

name: 'Thayige thakka maga',

year: 2018,

duration: '147 min',

rating: 5.4,

genre: 'Drama',

lang: 'kannada'

},

_id: ObjectId('667f29b50c118ded9b39bd97'),

name: 'Bhootayyana Maga Ayyu',

year: 1974,

duration: '155 min',

rating: 8.3,

genre: 'Drama',

lang: 'kannada'

]
Note:

 The negated word excludes documents that contain the negated word from
the result set.

 When passed a string that only contains negated words, $text does not
match any documents.

Question 10

Aggregation Pipeline to illustrate Text search on Catalog data collection

Develop an aggregation pipeline to illustrate Text search on Catalog data


collection.

The aggregation framework in MongoDB is a powerful tool for data processing


and transformation. It consists of a series of stages, each stage performing an
operation on the input documents and passing the results to the next stage. This
sequence of operations is called an aggregation pipeline.

Here is a step-by-step guide to using the aggregation pipeline in MongoDB, with


examples of various stages such as $match, $group, $sort, $project, $skip, and
others.

Example Aggregation Pipeline

Let’s create an aggregation pipeline that includes various stages:

1. $match: Filter documents to include only those in the year 2017.

2. $group: Group documents by genre and calculate the average rating for
each genre.

3. $sort: Sort the results by avgRating in descending order.

4. $project: Include specific fields in the output

5. $limit: Limit the output to 5 results.

6. $skip: Skip the first two results.


result = db.catalog.aggregate([

// 1. Match stage: Filter documents by year 2017

{$match:{year :2017}},

// 2. Group stage: Group by genre and calculate average rating

{$group:{_id: "$genre", avgRating:{$avg: "$rating"}}},

// 3. Sort stage: Sort by avgRating in descending order

{$sort: {avgRating:-1}},

// 4. Project stage: Include specific fields

{$project:{year:"$year", avgRating:1, genre:1} },

// 5. Limit stage: Limit the output to 5 results

{$limit:5} ]).toArray()

print("Top 5 rated movie genres with their average rating")

printjson(result)

Output

Top 5 rated movie genres with their average rating

_id: 'Mystery',

avgRating: 7.9

},
{

_id: 'Comedy, Drama ',

avgRating: 7.9

},

_id: 'Adventure, Family ',

avgRating: 7.8

},

_id: 'Drama, Romance, Thriller ',

avgRating: 7.7

},

_id: 'Drama',

avgRating: 7.616666666666667

Adding another stage using $skip

To find the remaining five genres among the top-rated seven genres after skipping
the first two, we can use the $skip stage in the aggregation pipeline.

result2 = db.catalog.aggregate([

// 1. Match stage: Filter documents by year 2017

{$match:{year :2017}},
// 2. Group stage: Group by genre and calculate average rating

{$group:{_id: "$genre", avgRating:{$avg: "$rating"}}},

// 3. Sort stage: Sort by avgRating in descending order

{$sort: {avgRating:-1}},

// 4. Project stage: Include specific fields

{$project:{year:"$year", avgRating:1, genre:1} },

// 5. Limit stage: Limit the output to 7 results

{$limit:7},

// 6. Skip stage: Skip the first two results

{$skip:2} ]).toArray()

print("Top 7 rated movie genres with their average rating with he firsttwo
skipped")

printjson(result2)

Output

Top 7 rated movie genres with their average rating with he firsttwo skipped

kannadaMoviesDB> printjson(result2)

_id: 'Adventure, Family ',

avgRating: 7.8

},
{

_id: 'Drama, Romance, Thriller ',

avgRating: 7.7

},

_id: 'Drama',

avgRating: 7.616666666666667

},

_id: 'Drama, History, Musical ',

avgRating: 7.5

},

_id: 'Drama, Family, History ',

avgRating: 7.5

The aggregation pipeline in MongoDB provides a flexible and powerful


framework for data processing. By chaining multiple stages, you can filter, group,
sort, project, and transform your data to suit your specific requirements.

You might also like