0% found this document useful (0 votes)

14 views30 pages

Open Elective Project

The document outlines a project titled 'Desktop Assistant Using Python,' submitted by students of Amity University as part of their Bachelor of Computer Applications degree. It details the project's purpose, which is to create a voice-activated desktop assistant that can perform various tasks using speech recognition and natural language processing. The document also includes acknowledgments, project design, implementation methodology, and the necessary hardware and software requirements.

Uploaded by

Vicky Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views30 pages

Open Elective Project

Uploaded by

Vicky Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 30

OPEN ELECTIVE PROJECT

DESKTOP ASSISTANT USING PYTHON

Submitted in a partial fulfillment of the requirement for the award of the degree.
Of
BACHELOR OF COMPUTER APPLICATION

Submitted By
MIHIR (A50504920005)
SHAGUN(A50504820016)
ISHAN (A50504820008)

Under the guidance of

DR. SHALINI BHASKAR BAJAJ

Professor, ASET/AIIT

Amity School of Engineering and Technology

Amity University Haryana
Gurugram, India

1
AMITY INSTITUTE OF INFORMATION TECHNOLOGY

DECLARATION

I, Ishan Singh, Enrolment No. A50504820008, Mihir Chaudhery Enrolment

No. A50504920005 student of BSC(It) and Shagun Gowthwal Enrolment No
A50504820016 hereby declare that the project entitled ‘DESKTOP
ASSISTANT’ which is submitted by meto Amity Institute of Information
Technology, Amity University, Haryana, in partial fulfilment of the
requirement for the award of the degree of Bachelor of Computer Applications,
has not been previously formed the basis for the award of any degree, diploma
or other similar title or recognition.

Ishan Singh

(A50504820008)

Mihir Chaudhery

(A50504920005)

Shagun Gowthwal

(A50504820016)

2
Department of Computer Science and Engineering Amity Institute of
Information Technology

CERTIFICATE

This is to certify that Ishan Singh (Enrolment No:A50504820008) student of BCA (VI
Semester), CSE, AIIT, Amity University Haryana, has done his major project entitled
‘DESKTOP ASSISTANT’ under the guidance and supervision of the supervisor DR
SHALNI BHASKAR BAJAJ Professor ASET, AIIT during the ‘Jan- May 2023’. The
work was satisfactory he has shown complete dedication and devotion to the given
project.

D.R SHALNI BHASKAR BAJAJ

PROFFESOR
CSE, ASET

3
ACKNOWLEDGEMENT

Acknowledgement is not a mere obligation but epitome of humility and ineptness to all
those who have helped in the completion of this project. I am thankful to DR SHALNI
BHASKAR BAJAJ, ASET for her constant guidance and encouragement provided in this
endeavor. I also thank my parents for their continued support, understanding and patience
without whose support and understanding this endeavor would never been fruitful. I also
thank all my friends for helpingme out in completing this project and helping me in solving
various problems encountered during the progress of this project

4
CONTENTS

1. Introduction

1.1 Project Introduction.............................................................................06

1.2 Project Description...............................................................................06

2. Background Study

I History..........................................................................................................................07

II Literature Survey........................................................................................................07

3. Design of Project

Technologies used...........................................................................................................08

Hardware Connection....................................................................................................08

4. Implementation of Project.............................................................................09

5. Result of Project........................................................................................... 10

6. Future scope of project…………………………………………………….11

7. References......................................................................................................12

5
1. INTRODUCTION

1.1 Project Introduction

Human interaction is rapidly being supplanted by. Performance is one of the key reasons behind
this shift. Rather than progress, technology has undergone a significant transformation. In today's
world, we use technologies like Machine Learning and Neural Networks to teach our machines
to do their jobs on their own or to think like people. With the help of virtual assistants, we may
now communicate with our machines in the modern world.

Companies such as Google, Apple, Microsoft, and others have virtual assistants such as Google
Now, Siri, and Alexa that allow users to operate their machines just by speaking to them. These
types of virtual assistants are beneficial to the elderly, the visually and physically challenged,
children, and others by ensuring that interacting with machines is no longer a challenge. Even
blind persons who are unable to see the computer can communicate with it just through their
voice.

1.2 Project Description

The Desktop voice assistant who helps the end user to communicate with desktop computer with
voice and it also responds to the voice commands of the user. Our Proposed System has
capability to work with and without Internet Connectivity in desktop computer. It is named as
Desktop voice Assistant with Voice Recognition Intelligence, which takes the user input in form
of voice or text and process it and returns the output in various forms like action to be performed
or the search result is dictated to the end.

6
In many cases users can ask their virtual assistants questions, control home automation devices
and media playback, and manage other basic tasks such as email, to-do lists, and calendars - all
with verbal commands. In recent years, prominent virtual assistants for direct consumer use have
included Amazon's Alexa, Apple's Siri, Microsoft's Cortana, and Google Assistant. Also,
companies in various industries often incorporate some kind of virtual assistant technology into
their customer service or support. Recently, the emergence of recent artificial intelligence based
chatbots, such as ChatGPT, has brought increased capability and interest to the field of virtual
assistant products and services.
Desktop voice assistants leverage advanced technologies such as natural language processing,
speech recognition, and artificial intelligence to understand and interpret your voice commands.
They can assist you with a wide range of tasks, including:

Information Retrieval: You can ask the voice assistant to search the web, find answers to your
questions, provide weather updates, and deliver real-time news updates.

Productivity and Organization: Voice assistants can help you manage your calendar, set
reminders, create to-do lists, and send emails or text messages.

System Control: You can use voice commands to control various aspects of your computer, such
as opening applications, adjusting system settings, and launching files or documents.

Entertainment: Voice assistants can play music, podcasts, or audiobooks for you, as well as
provide recommendations for movies, TV shows, or games.

Smart Home Integration: If you have smart home devices, a voice assistant can be used to
control them. For example, you can adjust your thermostat, turn on/off lights, or lock doors using
voice commands.

The specific capabilities and features of a desktop voice assistant depend on the particular
software you are using. Some popular examples of desktop voice assistants include Microsoft
Cortana, Apple Siri, Google Assistant, and Amazon Alexa.

7
2. BACKGROUND STUDY

2.1 History

The idea of artificial intelligence goes back thousands of years, to ancient philosophers
considering questions of life and death. In ancient times, inventors made things called
automatons which were mechanical and moved independently of human intervention. The word
automaton comes from ancient Greek and means acting of one’s own will.

One of the earliest records of an automaton comes from 400 BCE and refers to a mechanical
pigeon created by a friend of the philosopher Plato. Many years later, one of the most famous
automatons was created by Leonardo da Vinci around the year 1495. So while the idea of a
machine being able to function on its own is ancient, for the purposes of this article, we’re going
to focus on the 20th century, when engineers and scientists began to make strides toward our
modern-day AI.

The first natural language processing computer program or the chatbot ELIZA was developed by
MIT professor Joseph Weizenbaum in the 1960s. It was created to "demonstrate that the
communication between man and machine was superficial". ELIZA used pattern matching and
substitution methodology into scripted responses to simulate conversation, which gave an
illusion of understanding on the part of the program.

Weizenbaum's own secretary reportedly asked Weizenbaum to leave the room so that she and
ELIZA could have a real conversation. Weizenbaum was surprised by this, later writing: "I had
not realized ... that extremely short exposures to a relatively simple computer program could
induce powerful delusional thinking in quite normal people.[10] This gave name to the ELIZA
effect, the tendency to unconsciously assume computer behaviors are analogous to human
behaviors; that is, anthropomorphisation, a phenomenon present in human interactions with
virtual assistants. The next milestone in the development of voice recognition technology was
achieved in the 1970s at the Carnegie Mellon University in Pittsburgh, Pennsylvania with
substantial support of the United States Department of Defense and its DARPA agency, funded
five years of a Speech Understanding Research program, aiming to reach a minimum vocabulary

8
of 1,000 words. Companies and academia including IBM, Carnegie Mellon University (CMU)
and Stanford Research Institute took part in the program. The result was "Harpy", it mastered
about 1000 words, the vocabulary of a three-year-old and it could understand sentences. It could
process speech that followed pre-programmed vocabulary, pronunciation, and grammar
structures to determine which sequences of words made sense together, and thus reducing speech
recognition errors. In 1986 Tangora was an upgrade of the Shoebox, it was a voice recognizing
typewriter. Named after the world's fastest typist at the time, it had a vocabulary of 20,000 words
and used prediction to decide the most likely result based on what was said in the past. IBM's
approach was based on a hidden Markov model, which adds statistics to digital signal processing
techniques. The method makes it possible to predict the most likely phonemes to follow a given
phoneme. Still each speaker had to individually train the typewriter to recognize his or her voice,
and pause between each word.

2.2 Literature Survey

A speech synthesizer takes input and produces an audio stream as output. A speech recognizer
on the other hand does the opposite. It takes an audio stream as input and thus turns it into text
transcription. The voice is a signal of infinite information. Direct analysis and synthesizing the
complex voice signal is due to too much information contained in the signal. Therefore the
digital signal processes such as Feature Extraction and Feature Matching are introduced to
represent the voice signal. In this project we directly use speech engine which use Feature
extraction technique
.Our aim to create more and more functionalities which can help human to assist in their daily
life and also reduces their efforts. Design of a compact large vocabulary speech recognition
system that can run efficiently on any laptop devices, accurately and with low latency.

Feature Extraction is a process where relevant information is extracted from the voice signal to
create a compact representation. Various techniques, such as Mel-Frequency Cepstral
Coefficients (MFCCs) or Linear Predictive Coding (LPC), can be applied to capture important
characteristics of the voice signal, including spectral features and temporal variations. These
features serve as a condensed representation of the voice signal that can be analyzed more
efficiently.

9
Feature Matching involves comparing the extracted features from the voice signal with pre-
defined patterns or models. This step is crucial for recognizing and identifying words or phrases
in the input audio stream. Techniques like Hidden Markov Models (HMMs) or Deep Neural
Networks (DNNs) can be employed for accurate and robust pattern matching.

By utilizing these feature extraction and matching techniques, we aim to develop a speech
recognition system that can handle a large vocabulary, accurately transcribe spoken words into
text, and operate with low latency. This system can have a wide range of applications, including
voice assistants, transcription services, voice-controlled devices, and more.

1
0
3. DESIGN

3.1 Technology Used

Python is an interpreted, object-oriented, high-level programming language with dynamic
semantics. Its high-level built in data structures, combined with dynamic typing and dynamic
binding, make it very attractive for Rapid Application Development, as well as for use as a
scripting or glue language to connect existing components together. Python's simple, easy to
learn syntax emphasizes readability and therefore reduces the cost of program maintenance.
Python supports modules and packages, which encourages program modularity and code reuse.
The Python interpreter and the extensive standard library are available in source or binary form
without charge for all major platforms, and can be freely distributed.

One of the notable strengths of Python is its extensive standard library, which provides a wide
range of modules and tools for tasks such as file handling, networking, web development, data
manipulation, and more. Additionally, Python has a thriving ecosystem of third-party libraries,
such as NumPy, Pandas, TensorFlow, Django, and Flask, which further extend its capabilities in
areas like scientific computing, data analysis, machine learning, and web development.

Python supports multiple programming paradigms, including procedural, object-oriented, and

functional programming styles. This flexibility allows developers to choose the most suitable
approach for their projects. Furthermore, Python is platform-independent, meaning it can run on
various operating systems, including Windows, macOS, Linux, and more. Its interpreter-based
execution also enables rapid development and testing cycles.

Python is widely adopted in different fields, including web development, data science, artificial
intelligence, automation, scripting, and system administration. Its simplicity and large
community support make it an ideal language for both beginners and experienced developers.

Overall, Python's combination of simplicity, readability, vast library ecosystem, and cross-
domain applicability makes it a popular and powerful programming language for a wide range of
1
1
applications.

3.2 HARDWARE / SOFTWARE REQUIREMENT

SPECIFICATIONS: SOFTWARE REQUIREMENTS:

Name of component Specification

Operating System window 11, window 10

Language Python 3.11.0

Browser Any of Mozilla, Opera, Chrome etc.

HARDWARE REQUIREMENTS:

Name of component Specification

RAM 128 MB
Processor Intel i5 core 3.2Ghz
Monitor 15" color monitor
Keyboard 122 key

1
2
4. IMPLEMENTATION OF PROJECT

4.1 Methodology
4.1.1 Python Speech Recognition

The device firstly converts speech input from the user into text using python module speech
recognition algorithm. From voice input taken from users, we can obtain texts from specialized
corpora arranged on the research center's computerized network server, that are briefly keep in
the computer system before being transferred to python's module for recognizing speech. Then,
the central processor accepts the similar text and feeds it.

4.1.2 API Call

Term API stands for Application Programming Interface. The API is just a software-based
interface which helps in establishing connection between two different systems at different
locations. In plenty of other phrases, an API seems to be the messenger which sends your request
towards the source location and afterwards returns the response to you.

4.1.3 Python Text-To-Speech using Pyttsx3.

The extent of a systems to speak the provided text aloud is known as text to speech (TTS).
Firstly, written file is transferred to a lexical representation, which is then subsequently
converted to output waveform which can be used to create as sound file using a TTS Engine.
3rd-party authors offer TTS engine in a variety of dialects, languages, and specialist vocabulary.

9
4.1.4 Datetime

The datetime module in Python provides functionalities for working with dates, times, and
durations. It allows developers to manipulate and perform calculations on dates and times,
parse and format date/time strings, and work with time zones.
The datetime module provides several classes, including date, time, datetime, timedelta, and
timezone, which are essential for handling various aspects of date and time-related
operations.The date class represents a date (year, month, day) and allows operations like date
arithmetic, comparisons, and formatting. It enables tasks such as calculating the difference
between two dates or extracting specific components from a given date.
The time class represents a time of day (hour, minute, second, microsecond) and can be used
for tasks like measuring time intervals, comparing times, or formatting time values. The
datetime class combines both date and time information, providing a comprehensive
representation of a specific point in time. It allows for operations like arithmetic, comparisons,
and formatting on date and time values together.
The timedelta class represents a duration or time interval and enables operations such as adding
or subtracting time intervals from date/time objects. It is useful for performing calculations
involving time differences or determining future or past dates based on a given duration.

4.1.5 Web browser

To perform Web Search. This module comes built-in with Python. The web browser module
also provides additional functions for more advanced usage, such as controlling the size and
position of the browser window, specifying a specific browser to use, or checking if a browser
is available. It's important to note that the behaviour of the web browser module depends on
the configuration of the user's system and the default web browser settings. Different operating
systems and browser configurations may have varying effects on how URLs are opened.
Overall, the web browser module in Python simplifies the process of opening web pages in the
default browser, allowing you to automate tasks that involve displaying web content.

10
4.1.6 Multiprocessing
Multiprocessing is a package that supports spawning processes using an API similar to
the threading module. The multiprocessing package offers both local and remote concurrency,
effectively side-stepping the Global Interpreter Lock by using subprocesses instead of threads.
Due to this, the multiprocessing module allows the programmer to fully leverage multiple
processors on a given machine. It is particularly useful for CPU-bound tasks that can benefit
from parallel execution, such as numerical computations, simulations, and data processing
tasks.However, it's important to note that the multiprocessing module may not always be the best
choice for every situation. The overhead of process creation and inter-process communication
can introduce additional complexity and may not be efficient for tasks that involve a lot of
communication or I/O-bound operations. In such cases, other concurrency models like threading
or asynchronous programming may be more suitable.

4.1.7 Selenium
In today's digital world, web browser automation has become an essential skill for many tasks,
ranging from automated testing and web scraping to web application interaction. Python, with
its simplicity and versatility, provides an excellent framework called Selenium for automating
web browsers. In this article, we will explore the power and capabilities of Selenium in Python
and how it can revolutionize your web automation workflows.
Its a powerful tool for controlling web browsers through programs and performing browser
automation. It is functional for all browsers, works on all major OS and its scripts are written
in various languages i.e Python, Java, C#, etc, we will be working with Python. Selenium
Tutorial covers all topics such as – WebDriver, WebElement, Unit Testing with selenium. This
Python Selenium Tutorial covers Selenium from basics to advanced and professional uses.

11
Selenium is an open-source browser automation framework that allows developers to control
web browsers programmatically. It provides a seamless interface to interact with web elements,
simulate user actions, and extract data from web pages. Selenium supports various web
browsers, including Chrome, Firefox, Safari, and Edge, enabling cross-browser automation.

To get started with Selenium in Python, you need to install the Selenium package using pip.
Additionally, you'll need to download the appropriate web driver executable for the browser
you want to automate. Once the dependencies are set up, you can import the Selenium module
and create a web driver instance.

Selenium in Python empowers developers to harness the potential of web browser automation.
Whether you're automating testing scenarios, scraping data from websites, or interacting with
web applications, Selenium provides a robust and flexible framework. Its extensive features,
cross-browser compatibility, and integration with Python's ecosystem make it a go-to solution
for web automation tasks. By mastering Selenium, you can streamline your workflows,
improve efficiency, and unlock new possibilities in the world of web browser automation.

4.1.8 Winshell and Os

The OS module in Python provides functions for interacting with the operating system. OS
comes under Python’s standard utility modules. This module provides a portable way of using
operating system-dependent functionality. The *os* and *os.path* modules include many
functions to interact with the file system.

Python, with its extensive set of libraries and modules, offers a wide range of tools for system
automation. In this article, we will dive into two powerful modules, Winshell and os, that enable
developers to interact with the Windows operating system and perform various system-level
operations. We will explore the capabilities of these modules and understand how they can
simplify and streamline system automation tasks in Python.
12
On the other hand, the os module is a built-in Python module that provides a platform-
independent way to interact with the operating system. Although it is not specific to Windows, it
offers a wide range of functions and methods for performing system-level operations. With the
os module, you can perform file and directory operations like creating, deleting, renaming, and
traversing directories, as well as copying, moving, and deleting files. It also provides
functionalities for process management, allowing you to spawn new processes, get information
about running processes, terminate processes, and manipulate the process environment. The os
module enables you to access and modify environment variables, making it easy to retrieve
system-specific information or set configuration values. Additionally, it includes functions to
perform system-related tasks such as getting the current working directory, changing the working
directory, and executing system commands.

4.1.8 Urllib.request

The urllib.request module in Python is a built-in library that allows developers to interact with
URLs and perform various HTTP operations, such as sending HTTP requests, retrieving data
from web servers, and handling HTTP responses. It provides a high-level interface for making
network requests and simplifies the process of working with URLs. One of the key functions
provided by urllib.request is urlopen(). This function serves as the primary entry point for
making HTTP requests. By providing a URL as input, urlopen() sends an HTTP request to the
specified URL and returns a file-like object representing the response from the server. This
response object can be used to read the response data, handle headers, and access other
properties of the response. In addition to urlopen(), the module also provides a Request class,
which allows developers to construct and customize HTTP requests in a more flexible manner.
With the Request class, you can set headers, add data to the request body, specify request
methods (such as GET, POST, etc.), and handle other request-related parameters. This allows
for more control over the details of the HTTP request being sent
13
4.1.9 PyAutoGUI

PyAutoGUI is a powerful Python module that allows developers to automate GUI (Graphical
User Interface) interactions and perform tasks on the computer through simulated mouse
movements, keyboard input, and screen captures. With PyAutoGUI, developers can automate
repetitive tasks, create GUI testing scripts, and build interactive applications. One of the key
features of PyAutoGUI is its cross-platform compatibility, as it works on Windows, macOS,
and Linux operating systems. It provides a simple and intuitive interface for controlling mouse
movements and clicks, allowing developers to programmatically move the mouse cursor, click
on specific coordinates, and perform drag and drop operations. Similarly, PyAutoGUI provides
keyboard automation functions, enabling developers to send keypresses, type text, and simulate
hotkeys. PyAutoGUI also offers screen capture capabilities, allowing developers to take
screenshots of the screen or specific regions. This feature can be useful for visual testing,
capturing data from applications, or creating demonstrations. Despite its simplicity,
PyAutoGUI is a powerful automation tool with the ability to automate complex tasks.
However, developers should exercise caution when using PyAutoGUI, as it operates at the
level of screen pixels and can have unintended consequences if used incorrectly.

4.1.10 Urllib.request
module defines functions and classes which help in opening URLs (mostly HTTP) in a
complex world — basic and digest authentication, redirections, cookies and more.
The urllib.request module is part of the Python standard library and provides a high-level
interface for making HTTP requests and handling responses. It allows you to interact with
URLs and retrieve data from web resources.

14
Key components of urllib.request:

urlopen(url, data=None, timeout=None, *, cafile=None, capath=None, cadefault=False,

context=None): This function opens a URL and returns a response object. It can be used to
make GET requests and retrieve the response. The optional data parameter is used for making
POST requests with data. timeout sets a timeout for the request.

Request(url, data=None, headers={}, origin_req_host=None, unverifiable=False,

method=None): This class represents an HTTP request. It allows you to specify the URL, data
for POST requests, headers, and other parameters.

urlencode(query, doseq=False, safe='', encoding=None, errors=None, quote_via=quote_plus):

This function encodes a dictionary or sequence of key-value pairs into a URL-encoded string.
It is commonly used to provide data for POST requests.

response.read(): This method reads the content of the response as a byte string.

response.decode(encoding='utf-8', errors='strict'): This method decodes the byte string content

of the response into a Unicode string using the specified encoding.

response.geturl(): This method returns the URL of the response.

response.info(): This method returns an instance of the http.client.HTTPMessage class
containing the response headers.

The urllib.request module provides a convenient way to interact with web resources and
retrieve data using various HTTP methods. It is a powerful tool for performing HTTP requests
and handling responses within Python programs.

15
4.1.11 Psutil

is a Python cross-platform library used to access system details and process utilities. It is used
to keep track of various resources utilization in the system. Usage of resources like CPU,
memory, disks, network, sensors can be monitored. Hence, this library is used for system
monitoring, profiling, limiting process resources, and the management of running processes.
psutil (process and system utilities) is a cross-platform library for retrieving information
on running processes and system utilization (CPU, memory, disks, network, sensors) in
Python. It is useful mainly for system monitoring, profiling and limiting process
resources and management of running processes. It implements many functionalities offered by
classic UNIX command line tools such as ps, top, iotop, lsof, netstat.
The psutil module in Python is a versatile library that provides a convenient and cross-platform
interface for retrieving system-related information and managing processes. With psutil, you
can obtain details about running processes, monitor system utilization, and access various
system metrics.

One of the primary uses of psutil is process management. It allows you to retrieve information
about running processes on your system, such as their process ID, name, status, CPU and
memory usage, and more. You can interact with processes by terminating or suspending them
as needed. This makes it useful for tasks like monitoring and controlling the execution of
processes within your Python programs.

In addition to process management, psutil offers system monitoring capabilities. You can
monitor key system metrics such as CPU utilization, memory usage, disk usage, and network
statistics. It also provides access to sensor information like temperature and fan speed,
allowing you to keep track of system health.

psutil also provides functions to obtain system-related information. You can retrieve details
about the system's hardware configuration, including the number of CPUs, available memory,
16
disk partitions, and network interfaces. It also allows you to access the system's boot time,
providing insights into system uptime.

The psutil module is designed to be cross-platform, supporting various operating systems such
as Windows, macOS, Linux, FreeBSD, OpenBSD, and Solaris. It abstracts the underlying
system-specific details, providing a consistent API across platforms. This makes it easier to
write portable code that works seamlessly on different operating systems.

Overall, psutil is a powerful and convenient library for system monitoring, process
management, and accessing system-related information in Python. Its versatility and cross-
platform support make it a valuable tool for various applications, ranging from system
monitoring and resource management to process automation and performance analysis.

4.1.12 Win32.com.ClientDispatch

psutil (process and system utilities) is a cross-platform library for retrieving information
on running processes and system utilization (CPU, memory, disks, network, sensors) in Python. It
is useful mainly for system monitoring, profiling and limiting process resources and management
of running processes. It implements many functionalities offered by classic UNIX command line
tools such as ps, top, iotop, lsof, netstat, ifconfig, free and others. psutil currently supports the
following platforms.

The win32com.client.Dispatch module is part of the pywin32 library, which is commonly used for
COM (Component Object Model) automation on Windows systems. With
win32com.client.Dispatch, you can create and interact with COM objects from Python, enabling you
to automate various Windows applications and access their functionalities.

17
COM objects are software components that expose a defined interface, allowing other applications to
utilize their capabilities. By using win32com.client.Dispatch, you can connect to COM servers,
instantiate their objects, and control them from your Python code.

Using win32com.client.Dispatch, you can automate applications like Microsoft Excel, Word,
Outlook, and many others. It provides access to their methods, properties, and events, enabling you
to perform tasks such as reading and modifying data, generating reports, sending emails, and more.

To use win32com.client.Dispatch, you need to have the pywin32 package installed on your system.
Once installed, you can import the win32com.client module and use Dispatch to create instances of
COM objects and interact with them using Python.

Overall, win32com.client.Dispatch is a powerful module that facilitates COM automation in Python.

It allows you to leverage the functionalities of various Windows applications and integrate them into
your Python programs, enabling automation and enhancing productivity on Windows systems.

4.1.13 Pycaw

pycaw is a Python library typically used in Programming Style, Reactive

BeutififulSoupProgramming applications. pycaw has no bugs, it has no vulnerabilities, it has build
file available, it has a Permissive License and it has low support. You can install using 'pip install
pycaw' or download it from GitHub, PyPI.

Pycaw, a Python library built on top of the Core Audio Windows API, provides developers with a
robust and user-friendly interface for controlling audio devices and applications in Windows. In
this article, we will explore the capabilities of Pycaw and how it can revolutionize audio control
and manipulation in Python.

18
4.1.13 BeutifulSoup

Beautiful Soup 3 has been replaced by Beautiful Soup 4. Beautiful Soup 3 only works on Python
2.x, but Beautiful Soup 4 also works on Python 3.x. Beautiful Soup 4 is faster, has more features,
and works with third-party parsers like lxml and html5lib.

BeautifulSoup is a popular Python library used for web scraping and parsing HTML and XML
documents. In this summary, we will explore the key features and benefits of BeautifulSoup and
how it simplifies the process of extracting data from web pages.

BeautifulSoup provides a convenient and efficient way to parse HTML and XML documents. It
allows developers to navigate the document structure, extract specific elements, and manipulate the
data within them. With its robust parsing capabilities, BeautifulSoup handles complex HTML and
XML structures, including malformed or poorly structured documents.

One of the main strengths of BeautifulSoup is its ability to extract data from web pages.
Developers can use a variety of methods and techniques to locate specific HTML elements,
such as tags, classes, or attributes, and extract the desired data. Whether it's scraping text,
retrieving links, or fetching images, BeautifulSoup simplifies the extraction process and saves
developers valuable time and effort.

4.1.14 Pynput
Pynput is a Python library that allows developers to monitor and control input devices such as
keyboards and mice. With its powerful features and intuitive interface, Pynput simplifies the
process of capturing and simulating input events, making it an essential tool for creating
interactive applications, automating tasks, and building keyboard and mouse event listeners.
One of the key features of Pynput is its ability to monitor and capture input events in real-time.
Developers can easily listen for keyboard and mouse events, such as key presses, key releases,
mouse movements, and mouse clicks. This enables the creation of interactive applications that
respond to user input, as well as the development of keylogging or mouse tracking systems.

19
Pynput also allows developers to simulate input events, effectively controlling the keyboard
and mouse programmatically. This feature is valuable for automating tasks or creating virtual
input devices. By emulating keystrokes or mouse actions, developers can automate repetitive
actions, interact with applications, or simulate user interactions in testing scenarios. The library
provides a clear and intuitive API, making it accessible to both beginners and experienced
developers. It offers options for fine-grained control over input event handling, allowing for
customization and flexibility. Additionally, Pynput supports multi-threaded applications,
ensuring smooth input event handling without impacting the overall performance. Pynput's
versatility extends to its compatibility with different platforms, including Windows, macOS,
and Linux, making it a cross-platform solution for input device control. This ensures that
developers can leverage its features across various operating systems without needing to
rewrite their code. In conclusion, Pynput is a powerful Python library that simplifies the
monitoring and control of input devices such as keyboards and mice. With its real-time event
capturing, event simulation capabilities, cross-platform support, and user-friendly API, Pynput
empowers developers to create interactive applications, automate tasks, and build input event
listeners with ease.

4.1.15 Pyttsx3

The pyttsx3 module is a Python library that allows developers to integrate Text-to-Speech
(TTS) conversion into their applications. It serves various purposes, including accessibility,
where it aids individuals with visual impairments by converting written text into spoken words.
By utilizing pyttsx3, developers can create applications that provide audio feedback, such as
reading out notifications, emails, or other textual content. Moreover, it finds application in
language learning, as the module can pronounce words or phrases, helping users enhance their
pronunciation and listening skills. Additionally, pyttsx3 can be integrated into assistive
technologies or devices to deliver speech output, enabling users to access information or
interact with applications using voice commands. It also proves beneficial in automated voice
systems, like interactive voice response (IVR) systems, voice assistants, or chatbots, allowing

20
users to interact through spoken responses. Lastly, pyttsx3 can convert written content, such as
articles or blog posts, into audio files, making them accessible as podcasts or audio versions.
Overall, the pyttsx3 module serves as a powerful tool to incorporate Text-to-Speech
functionality in Python projects, enhancing accessibility, user experience, and automation
through speech output.

4.1.15 Pytz
The pytz module in Python is a third-party library that provides powerful functionalities for
working with time zones. Its main purpose is to facilitate time zone conversions, handle
daylight saving time (DST) transitions, and manage datetime calculations accurately.One of the
key uses of pytz is time zone conversions. It offers functions that enable developers to convert
datetime objects between different time zones effortlessly. This capability allows for the
representation and manipulation of timestamps across various regions of the world without
losing accuracy.Another essential feature of pytz is its ability to localize and normalize
datetime objects. When working with naive datetime objects that lack time zone information,
the module allows you to assign them a specific time zone. By localizing these objects, you can
ensure that they are associated with the correct time zone and accurately represent the intended
moment in time. Furthermore, pytz provides methods to normalize localized datetime objects,
ensuring consistent representation across different time zones.

4.1.16 Json
JSON (JavaScript Object Notation) data. JSON is a popular data interchange format used to
represent structured data in a human-readable and machine-readable format.
The json module offers several important features, including JSON serialization and
deserialization. Serialization is the process of converting Python objects into a JSON string,
while deserialization is the process of converting a JSON string back into Python objects. This
is achieved using the json.dumps() function for serialization and json.loads() function for
deserialization. These functions provide a convenient way to exchange data between different
systems or store data in a JSON format.

21
Another common use of the json module is reading and writing JSON files. With the json
module, you can easily read JSON data from a file using the json.load() function, which loads
the JSON data and returns a corresponding Python object. Similarly, the json.dump() function
allows you to write Python objects into a JSON file.
The json module also provides functionalities for handling JSON data within Python. You can
access and manipulate JSON data using Python's dictionary-like syntax, as JSON objects are
represented as dictionaries in Python

Fig -1 – Implementation of Project

22
5. RESULT

1. Personal productivity: Desktop assistants can help users manage their time, schedule
appointments, set reminders, and prioritize tasks.
2. Communication: Desktop assistants can help users manage their email, make calls, and
send messages, making it easier to stay in touch with others.
3. Information retrieval: Desktop assistants can help users find information on the web, such
as news articles, product reviews, and answers to common questions.
4. Entertainment: Desktop assistants can provide access to music, videos, and games,
allowing users to relax and unwind.
5. Automation: Desktop assistants can help automate repetitive or time-consuming tasks,
such as filling out forms or completing online purchases.
6. Device control: Desktop assistants can help users control their computer, such as
adjusting settings or launching applications, through voice commands or other interfaces.
7. Virtual and augmented reality: Desktop assistants can be integrated with virtual and
augmented reality technologies to provide new use cases and applications, such as controlling a
smart home or playing games in virtual reality.

Fig -2 Result

23
5.1 FUTURE SCOPE OF PROJECT

1. Enhanced Natural Language Understanding:

AI desktop assistants will continue to improve their ability to understand and interpret natural
language. They will become more adept at understanding complex queries, handling context, and
accurately responding to user commands.

2. Personalization and Context Awareness:

Future AI desktop assistants will become more personalized and context-aware. They will learn from
user interactions and preferences to provide tailored recommendations, anticipate user needs, and
proactively assist in various tasks.

3. Multimodal Interaction:
AI desktop assistants will evolve to support multimodal interaction, combining voice commands,
text input, gestures, and even visual cues. This will enable more natural and intuitive interactions
with the assistant, making it more versatile and user-friendly.

4.Integration with IoT Devices:

AI desktop assistants will seamlessly integrate with Internet of Things (IoT) devices, allowing users
to control and manage their smart homes, appliances, and other connected devices using voice
commands or other forms of interaction.

5.Advanced Task Automation:

AI desktop assistants will continue to automate repetitive tasks, such as scheduling meetings,
managing emails, organizing files, and more. They will become more intelligent in understanding
user preferences and proactively performing tasks on their behalf.

6.Voice and Emotion Recognition:

AI desktop assistants will improve their ability to recognize different voices and distinguish between
multiple users. Additionally, they will develop the capability to identify emotions from voice cues,
allowing for more empathetic and personalized interactions.

7.Deeper Integration with Applications:

AI desktop assistants will integrate more deeply with various applications, enabling seamless control
and interaction. They will have the ability to access and manipulate data from different applications,
perform complex tasks, and provide intelligent insights and recommendations.

8.Privacy and Security:

As AI desktop assistants handle sensitive information and interact with personal data, future
developments will focus on strengthening privacy and security measures. This includes robust data
encryption, secure authentication mechanisms, and user-controlled data access.

24
9.Adaptive Learning and Continuous Improvement:
AI desktop assistants will employ adaptive learning techniques to improve their performance over
time. They will learn from user feedback, analyze user interactions, and continuously refine their
capabilities and responses.

10.Collaboration and Integration with Humans:

AI desktop assistants will not only work autonomously but also collaborate with humans in a more
seamless manner. They will assist in team collaborations, support virtual meetings, and facilitate
information sharing among users.

25
6. REFRENCES

 http://www.ijstr.org/final-print/dec2019/Advanced-Desktop-Assistant-With-Voice-
Recognition-Security-And-Voice-Email-Capability-Using-Aritificial-Intelligence.pdf
 http://www.python.com
 https://www.geeksforgeeks.org/voice-assistant-using-python/
 https://pypi.org/project/pyttsx3/

Final 1
No ratings yet
Final 1
39 pages
Desktop Voice Assistant
No ratings yet
Desktop Voice Assistant
4 pages
Jarvis Synopsis
No ratings yet
Jarvis Synopsis
18 pages
Athena Research Paper
No ratings yet
Athena Research Paper
5 pages
Final Report R
No ratings yet
Final Report R
85 pages
Virtual Assistant Using Python Ijariie18581
No ratings yet
Virtual Assistant Using Python Ijariie18581
4 pages
JETIRGH06016
No ratings yet
JETIRGH06016
6 pages
DesktopAssistant Reoprt
No ratings yet
DesktopAssistant Reoprt
42 pages
Miniproject Synopsis
No ratings yet
Miniproject Synopsis
7 pages
Virtual Assistance Using Python
No ratings yet
Virtual Assistance Using Python
5 pages
Reportt
No ratings yet
Reportt
19 pages
Machine Learning Project
No ratings yet
Machine Learning Project
11 pages
Desktop Voice Assistant Research Paper
No ratings yet
Desktop Voice Assistant Research Paper
3 pages
Python Voice Assistant Survey
No ratings yet
Python Voice Assistant Survey
5 pages
B.E Cse Batchno 52
No ratings yet
B.E Cse Batchno 52
77 pages
Deskmate Assistant Project
No ratings yet
Deskmate Assistant Project
8 pages
B.E Cse Batchno 52
No ratings yet
B.E Cse Batchno 52
77 pages
Report Further Page
No ratings yet
Report Further Page
28 pages
Desktop'S Virtual Assistant Using Python: N Umapathi, G Karthick, N Venkateswaran, R Jegadeesan, Dava Srinivas
No ratings yet
Desktop'S Virtual Assistant Using Python: N Umapathi, G Karthick, N Venkateswaran, R Jegadeesan, Dava Srinivas
10 pages
Voice Assistants for Accessibility
No ratings yet
Voice Assistants for Accessibility
5 pages
Synopsis AI
No ratings yet
Synopsis AI
21 pages
Semifinal
No ratings yet
Semifinal
19 pages
2 BP
No ratings yet
2 BP
10 pages
Report
No ratings yet
Report
53 pages
Jarvis: Virtual Voice Command Desktop Assistant
No ratings yet
Jarvis: Virtual Voice Command Desktop Assistant
4 pages
AI-based Desktop Voice Assistant
No ratings yet
AI-based Desktop Voice Assistant
4 pages
Chapter 2: THE PROJECT
No ratings yet
Chapter 2: THE PROJECT
25 pages
Virtual Assistant For Desktop
No ratings yet
Virtual Assistant For Desktop
4 pages
Synopsis
No ratings yet
Synopsis
6 pages
AI Voice Assistant Development
No ratings yet
AI Voice Assistant Development
5 pages
Personal Voice Assistant
100% (1)
Personal Voice Assistant
118 pages
Synopsis
No ratings yet
Synopsis
6 pages
Paper Published
No ratings yet
Paper Published
3 pages
Building Your Personalized Ai Assistant Phase2
No ratings yet
Building Your Personalized Ai Assistant Phase2
7 pages
Synopsis - Voice Assistant Using Python
No ratings yet
Synopsis - Voice Assistant Using Python
5 pages
AI Project Report
No ratings yet
AI Project Report
7 pages
Desktop Assistant Project Report
No ratings yet
Desktop Assistant Project Report
71 pages
Minor Project Report
No ratings yet
Minor Project Report
5 pages
Personal AI Desktop Assistant
No ratings yet
Personal AI Desktop Assistant
8 pages
Developing-A-Desktop-Voice Project
No ratings yet
Developing-A-Desktop-Voice Project
5 pages
Pvaresearch
No ratings yet
Pvaresearch
2 pages
Virtual Voice Assistant My Proj123
No ratings yet
Virtual Voice Assistant My Proj123
29 pages
Voice Assistent Using Python Synopsis
No ratings yet
Voice Assistent Using Python Synopsis
10 pages
Voice Assistant Mini-Project Report
No ratings yet
Voice Assistant Mini-Project Report
24 pages
Paper Format
No ratings yet
Paper Format
6 pages
Voice Assistant Project Report
No ratings yet
Voice Assistant Project Report
41 pages
Research On Content Summarizer
No ratings yet
Research On Content Summarizer
4 pages
Virtual Assistant Project Guide
No ratings yet
Virtual Assistant Project Guide
5 pages
AI Voice Commands Assistant ZEUS
No ratings yet
AI Voice Commands Assistant ZEUS
8 pages
VIRTUAL
No ratings yet
VIRTUAL
67 pages
Shashank PosterPresentations
No ratings yet
Shashank PosterPresentations
1 page
Minor Degree Major Document
No ratings yet
Minor Degree Major Document
35 pages
Final PDF 9TH Sem
No ratings yet
Final PDF 9TH Sem
43 pages
Desktop Voice Assistant
No ratings yet
Desktop Voice Assistant
3 pages
My Voice Assistant Using Python
No ratings yet
My Voice Assistant Using Python
6 pages
Virtual Assistant Report
No ratings yet
Virtual Assistant Report
17 pages
Ai Project (Voice Assisstant)
No ratings yet
Ai Project (Voice Assisstant)
18 pages
IRJET V8I4696 With Cover Page v2
No ratings yet
IRJET V8I4696 With Cover Page v2
4 pages
Project Report On Virtual Assistant Subm
No ratings yet
Project Report On Virtual Assistant Subm
30 pages
MCA2282
No ratings yet
MCA2282
33 pages
Galgotias Selected List
No ratings yet
Galgotias Selected List
14 pages
TechSnitch Confirmation 28th Jan
No ratings yet
TechSnitch Confirmation 28th Jan
2 pages
Interview Attendance List
No ratings yet
Interview Attendance List
2 pages
ADA MCA Assignment1
No ratings yet
ADA MCA Assignment1
1 page
Deepak Oe
No ratings yet
Deepak Oe
40 pages
Event Loop in Dart
No ratings yet
Event Loop in Dart
8 pages
XIIComp SC H Y 463
No ratings yet
XIIComp SC H Y 463
5 pages
Xii Computer Science
No ratings yet
Xii Computer Science
7 pages
LO-COCKPIT Extraction For Begginers
No ratings yet
LO-COCKPIT Extraction For Begginers
17 pages
Hadoop Basics for Data Science Students
No ratings yet
Hadoop Basics for Data Science Students
22 pages
Advanced Mapreduce
No ratings yet
Advanced Mapreduce
37 pages
VNode Manual RestApi Client v1.8
No ratings yet
VNode Manual RestApi Client v1.8
31 pages
JSON.NET Integration for Developers
No ratings yet
JSON.NET Integration for Developers
9 pages
Matlab Vision HDL Toolbox Documentation Download
100% (1)
Matlab Vision HDL Toolbox Documentation Download
90 pages
Facebook Thrift
100% (2)
Facebook Thrift
27 pages
AUTOSAR SWS COMBasedTransformer
No ratings yet
AUTOSAR SWS COMBasedTransformer
58 pages
Drone Kit-Python For Autonomous Quadcopter Navigat
No ratings yet
Drone Kit-Python For Autonomous Quadcopter Navigat
8 pages
2349b - Programming With The MS .NET Framework MS VCSharp PDF
No ratings yet
2349b - Programming With The MS .NET Framework MS VCSharp PDF
938 pages
CX - 0021 Batch UseCaseTraceability V 1.0.1
No ratings yet
CX - 0021 Batch UseCaseTraceability V 1.0.1
30 pages
Conlib Filehandlingtest1answerkey 20250305094905-1
No ratings yet
Conlib Filehandlingtest1answerkey 20250305094905-1
72 pages
SAP DMT Integration Guide
No ratings yet
SAP DMT Integration Guide
18 pages
BTSHOL01 Building Your First BizTalk Solution
No ratings yet
BTSHOL01 Building Your First BizTalk Solution
24 pages
SAP IDoc and ALE Configuration Guide
No ratings yet
SAP IDoc and ALE Configuration Guide
9 pages
Session 1.7 Access Modifiers
No ratings yet
Session 1.7 Access Modifiers
44 pages
SEA Vision Full Stack Solutions Provider
No ratings yet
SEA Vision Full Stack Solutions Provider
72 pages
VS2005 JSharp En-Us PDF
No ratings yet
VS2005 JSharp En-Us PDF
6,421 pages
Serial Numbers Functionality and Configuration in SAP PP
No ratings yet
Serial Numbers Functionality and Configuration in SAP PP
16 pages
OOPs Interview Questions and Answers
No ratings yet
OOPs Interview Questions and Answers
19 pages
AnswerSheet Part2
No ratings yet
AnswerSheet Part2
101 pages
Advanced Java Unit 3 Digital Notes
100% (1)
Advanced Java Unit 3 Digital Notes
67 pages
ATTP 2.01-Devlpr Guide WH Toolbox-FINAL
No ratings yet
ATTP 2.01-Devlpr Guide WH Toolbox-FINAL
16 pages
TensorFlow Lite Micro Embedded Machine L
No ratings yet
TensorFlow Lite Micro Embedded Machine L
13 pages
Chapter 2-File Handling in Python
No ratings yet
Chapter 2-File Handling in Python
13 pages
Student Management System Project in C
No ratings yet
Student Management System Project in C
8 pages
Main Python
No ratings yet
Main Python
34 pages

Open Elective Project

Uploaded by

Open Elective Project

Uploaded by

OPEN ELECTIVE PROJECT

DESKTOP ASSISTANT USING PYTHON

Under the guidance of

DR. SHALINI BHASKAR BAJAJ

Amity School of Engineering and Technology

I, Ishan Singh, Enrolment No. A50504820008, Mihir Chaudhery Enrolment

D.R SHALNI BHASKAR BAJAJ

1.1 Project Introduction.............................................................................06

1.2 Project Description...............................................................................06

6. Future scope of project…………………………………………………….11

1.1 Project Introduction

1.2 Project Description

2.2 Literature Survey

3.1 Technology Used

Python supports multiple programming paradigms, including procedural, object-oriented, and

3.2 HARDWARE / SOFTWARE REQUIREMENT

SPECIFICATIONS: SOFTWARE REQUIREMENTS:

Name of component Specification

Operating System window 11, window 10

Language Python 3.11.0

Name of component Specification

4.1.2 API Call

4.1.3 Python Text-To-Speech using Pyttsx3.

4.1.5 Web browser

4.1.8 Winshell and Os

urlopen(url, data=None, timeout=None, *, cafile=None, capath=None, cadefault=False,

Request(url, data=None, headers={}, origin_req_host=None, unverifiable=False,

urlencode(query, doseq=False, safe='', encoding=None, errors=None, quote_via=quote_plus):

response.decode(encoding='utf-8', errors='strict'): This method decodes the byte string content

response.geturl(): This method returns the URL of the response.

Overall, win32com.client.Dispatch is a powerful module that facilitates COM automation in Python.

pycaw is a Python library typically used in Programming Style, Reactive

Fig -1 – Implementation of Project

1. Enhanced Natural Language Understanding:

2. Personalization and Context Awareness:

4.Integration with IoT Devices:

5.Advanced Task Automation:

6.Voice and Emotion Recognition:

7.Deeper Integration with Applications:

8.Privacy and Security:

10.Collaboration and Integration with Humans:

You might also like