0% found this document useful (0 votes)

35 views59 pages

Final Year Report Final

Uploaded by

suryaveccse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views59 pages

Final Year Report Final

Uploaded by

suryaveccse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 59

Heart Attack Risk Prediction using

Machine Learning

A PROJECT REPORT
Submitted by

PRAMODH KUMAR P (113219031098)

PUGAZHENDI B (113219031168)
SANTHOSH KUMAR S (113219031168)

in partial fulfillment of the award of the degree

Of
BACHELOR OF ENGINEERING
IN
COMPUTER SCIENCE AND ENGINEERING

VELAMMAL ENGINEERING COLLEGE, CHENNAI-66.

(An Autonomous Institution, Affiliated to Anna University, Chennai)

April 2024
I
VELAMMAL ENGINEERING COLLEGE
CHENNAI -66

BONAFIDE CERTIFICATE

Certified that this project report “Heart Attack Risk Prediction using Machine Learning” is the
bonafide work of “PRAMODH KUMAR P(113219031098),PUGAZHENDI B(113219031168),
SANTHOSH KUMAR S(113219031168)” who carried out the project work under my supervision.

Dr. B. MURUGESHWARI. DR. H. JEYAMOHAN

PROFESSOR & HEAD SUPERVISOR
Department of Computer Science and Assistant Professor
Engineering Department of Computer Science and
Velammal Engineering College Engineering
Ambattur – Red hills Road Velammal Engineering College
Chennai – 600 066. Ambattur – Red hills Road
Chennai – 600 066.

II
CERTIFICATE OF EVALUATION

COLLEGE NAME : VELAMMAL ENGINEERING COLLEGE

BRANCH : COMPUTER SCIENCE AND ENGINEERING
SEMESTER : VIII

SI.NO Name of the students who Title of the Project Name of supervisor
has done the project With designation

1 PRAMODH KUMAR P HEART ATTACK RISK MR.H.JEYAMOHAN

PREDICTION USING Assistant
2 PUGAZHENDI B
MACHINE LEARNING Professor
3 SANTHOSH KUMAR S

This report of Project work submitted by the above students in the partial fulfillment for the award
of Bachelor of Engineering Degree in Anna University was evaluated and confirmed to be reports
of the work by the above student and then assessed.

Submitted for Internal Evaluation held on ……………….

Internal Examiner
External Examiner

III
ABSTRACT

Heart disease remains a leading cause of mortality worldwide, underscoring the need for effective

predictive tools to mitigate its impact. Machine learning (ML) techniques offer promising avenues

for predicting heart attacks by leveraging diverse patient data. This abstract presents the

development of a predictive model for heart attacks implemented through a user-friendly Streamlit

application.The proposed system integrates ML algorithms with an intuitive user interface, allowing

healthcare professionals and individuals to assess their risk of experiencing a heart attack.

Leveraging datasets comprising various demographic, clinical, and lifestyle factors, the model

employs feature selection, preprocessing techniques, and ensemble learning to optimize predictive

accuracy.The Streamlit application provides a seamless platform for users to input their personal

data, such as age, gender, blood pressure, cholesterol levels, and lifestyle habits. Through

interactive visualizations and user-friendly controls, individuals gain insights into their

cardiovascular health and potential risk factors.further more, the application's backend utilizes ML

algorithms, including logistic regression, decision trees, and support vector machines, to analyze

input data and generate personalized risk assessments. By harnessing the power of ML, the model

continuously learns from new data, enhancing its predictive capabilities and adaptability over time.

IV
ACKNOWLEDGEMENT

I wish to acknowledge with thanks, the significant contribution given by the management
of our college Chairman, Dr.M.V. Muthuramalingam, and our Chief Executive Officer Thiru.
M.V.M. Velmurugan, for their extensive support.

I would like to thank Dr. S. Satish Kumar, Principal of Velammal Engineering College,
for giving me this opportunity to do this project.

I express my gratitude to our effective Head of the Department, Dr. B. Murugeshwari,

for her moral support and valuable innovative suggestions, constructive interaction, constant
encouragement, and unending help that have enabled me to complete the project.

I express my thanks to our Project Coordinators, Dr. P. S. Smitha, Dr. P. Pritto Paul, and
Dr. S. Rajalakshmi, Department of Computer Science and Engineering for their invaluable
guidance in the shaping of this project.

I express my sincere gratitude to my Internal Guide, Dr. H. Jeyamohan, Assistant

Professor, Department of Computer Science and Engineering for her guidance, without her this
project would not have been possible.

I am grateful to the entire staff members of the Department of Computer Science and
Engineering for providing the necessary facilities and carrying out the project. I would especially
like to thank my parents for providing me with the unique opportunity to work and for their
encouragement and support at all levels. Finally, my heartfelt thanks to The Almighty for guiding
me throughout my life.

V
TABLE OF CONTENTS

CHAPTER TITLE PAGE NO.

NO.

ABSTRACT IV

LIST OF FIGURES X

LIST OF ABBREVIATIONS XII

1 INTRODUCTION 1

1.1 PURPOSE OF THE PROJECT 1

1.2 SCOPE OF THE PROJECT 2

1.3 DOMAIN INTRODUCTION 2

1.3.1 MACHINE LEARNING 2

1.3.2 BIG DATA ANALYSIS 5

2 LITERATURE SURVEY 8

2.1 INTRODUCTION 8

2.2 CODE GENERATION USING MACHINE 8

LEARNING

VI
2.3 APPLYING CODEBERT AUTOMATED 09
PROGRAM REPAIR OF JAVA SIMPLE BUGS

2.4 CODE GENERATION USING TRANSFORMER 09

2.5 A PRE-TRAINED MODEL FOR PROGRAMMING 10

AND NATURAL LANGUAGES

3 SYSTEM ANALYSIS 11

3.1 EXISTING SYSTEM 11

4 SYSTEM SPECIFICATION 12

4.1 SOFTWARE SPECIFICATION 12

4.1.1 VISUAL STUDIO CODE 12

4.1.2 ANACONDA DISTRIBUTION 13

4.1.3 PYTHON 14

4.1.4 KAGGLE DATASET 14

4.1.5 STREAMLIT 15

4.1.6 MACHINE LEARNING LANGUAGE MODEL 15

4.2 HARDWARE SPECIFICATION 16

VII
5 SYSTEM DESIGN 17

5.1 ARCHITECTURE DIAGRAM 17

5.2 UML DIAGRAMS 18

5.2.1 USE CASE DIAGRAM 19

5.2.2 CLASS DIAGRAM 20

5.2.3 SEQUENCE DIAGRAM 21

5.2.4 COLLABORATION DIAGRAM 22

5.2.5 ACTIVITY DIAGRAM 23

5.2.6 DATA FLOW DIAGRAM 25

6 SYSTEM IMPLEMENTATION 27

6.1 MODULES 27

6.2 MODULE DESCRIPTION 27

6.2.1 USER INTERFACE 27

6.2.2 MACHINE LEARNING LANGUAGE MODEL 28

6.2.3 DATA ANALYSIS 29

6.2.4 NATURAL LANGUAGE PROCESSING 30

7 TESTING 32

7.1 INTRODUCTION 32

VIII
7.2 TESTING OBJECTIVES 32

7.3 TYPES OF TESTING 33

7.3.1 UNIT TESTING 33

7.3.2 INTEGRATION TESTING 33

8 CONCLUSION AND FUTURE WORK 35

8.1 CONCLUSION 35

8.2 FUTURE ENHANCEMENT 35

APPENDIX I - SOURCE CODE 36

APPENDIX II - SNAPSHOTS 42

REFERENCES 44

PROGRAM SPECIFIC OUTCOMES 47

IX
LIST OF FIGURES

F.NO NAME OF THE FIGURE PAGE NO

1.1 Machine learning 3

Diagram

1.2 Artificial Intelligence 3

domains

5.1 System Architecture 17

5.2 UML diagram 18

5.2.1 Use Case diagram 20

5.2.2 Class diagram 21

5.2.3 Sequence diagram 22

5.2.4 Collaboration diagram 23

5.2.5 Activity diagram 24

5.2.6.1 L0 Data flow diagram 25

5.2.6.1 L1 Data flow diagram 26

6.2.1 User Interface design 28

6.2.2 Machine learning language 29

model

X
6.2.3 Data analysis 30

6.2.4 Natural language processing 31

9.1 Output Page 42

9.2 Person have heart attack 42

9.3 Person does not have heart 43

attack

XI
LIST OF ABBREVIATIONS

AI Artificial Intelligence

NLP Natural Language Processing

ML Machine Learning

ASR Automatic Speech Recognition

PL Programming Language

IDE Integrated Development Environment

API Application Programming Interface

NL Natural Language

XII
CHAPTER 1

INTRODUCTION

1.1 PURPOSE OF THE PROJECT

The purpose of this project extends beyond mere prediction; it encompasses several key objectives:

Early Detection: One of the primary purposes is to detect the risk factors associated with heart attacks at
an early stage. By leveraging machine learning algorithms, we aim to identify subtle patterns and
indicators that might go unnoticed through traditional diagnostic methods.

Risk Stratification: Not all individuals have the same risk profile for heart attacks. Through this project,
we seek to stratify individuals based on their risk levels, enabling healthcare professionals to allocate
resources efficiently and tailor interventions according to the specific needs of each group.

Personalized Medicine: Every individual possesses a unique set of characteristics and risk factors. By
incorporating personalized data such as medical history, lifestyle factors, and genetic predispositions, our
goal is to develop models that offer personalized risk assessments, empowering individuals to make
informed decisions about their health.

Prevention and Intervention: Ultimately, the overarching purpose is to prevent heart attacks and mitigate
their impact. By accurately predicting the likelihood of a heart attack, healthcare providers can intervene
proactively, implementing preventive measures such as lifestyle modifications, medication, and targeted
interventions to reduce the risk and severity of cardiovascular events.

Public Health Impact: Beyond individual-level interventions, this project aims to contribute to broader
public health initiatives. By identifying population-level trends and risk factors, policymakers and
healthcare authorities can implement preventive strategies, allocate resources efficiently, and design
targeted interventions to address the root causes of CVDs at a population level.

1
1.2 SCOPE OF THE PROJECT

Data Acquisition and Integration:

The project will involve gathering diverse datasets from multiple sources, including electronic health
records (EHRs), medical imaging, genetic data, wearable devices, lifestyle information, and socio-
economic factors.
Integration of heterogeneous data sources will pose challenges in data harmonization, preprocessing, and
ensuring interoperability for model development.

Feature Selection and Engineering:

Exploring a wide array of potential predictors, including demographic variables, medical history,
biomarkers, imaging features, lifestyle factors, genetic markers, and environmental exposures.
Employing feature selection techniques to identify relevant predictors and feature engineering methods to
create new informative features that enhance model performance.

Predictive Modeling Techniques:

Utilizing a range of machine learning algorithms such as logistic regression, decision trees, random
forests, gradient boosting, support vector machines, neural networks, and ensemble methods.
Experimenting with both traditional statistical models and deep learning architectures to capture complex
patterns and nonlinear relationships in the data.

1.3 DOMAIN INTRODUCTION:

1.3.1 MACHINE LEARNING:

Machine learning (ML) has emerged as a transformative technology with profound implications across
various domains, revolutionizing how we process data, derive insights, and make decisions. Rooted in the
field of artificial intelligence (AI), machine learning algorithms enable computers to learn from data,
identify patterns, and make predictions or decisions without being explicitly programmed. This
introduction aims to provide an overview of machine learning, exploring its fundamental concepts,
methodologies, applications, and societal impact.

At its core, machine learning is concerned with the development of algorithms that improve their
performance over time as they are exposed to more data. This learning process can be broadly categorized
into three main types:

2
Fig 1.1: Machine Learning diagram

Fig 1.2: Artificial Intelligence domains

Supervised Learning: In supervised learning, the algorithm is trained on labeled data, where each input
is associated with a corresponding output. The goal is to learn a mapping from inputs to outputs, enabling
the algorithm to make predictions on unseen data. Common applications include classification (e.g., spam
detection, image recognition) and regression (e.g., predicting house prices, forecasting sales).
3
Unsupervised Learning: Unsupervised learning involves training algorithms on
unlabeled data, where the objective is to uncover hidden patterns or structures within the data.
Clustering algorithms, such as k-means and hierarchical clustering, group similar data points
together based on their features, while dimensionality reduction techniques like principal
component analysis (PCA) extract meaningful representations of the data.

Reinforcement Learning: Reinforcement learning revolves around training agents to

interact with an environment and learn optimal strategies through trial and error. Agents receive
feedback in the form of rewards or penalties based on their actions, allowing them to learn from
experience and adapt their behavior to achieve long-term goals. Applications range from game
playing (e.g., AlphaGo) to robotics and autonomous systems..

MACHINE LEARNING APPLICATIONS:

Healthcare:

In healthcare, machine learning is revolutionizing disease diagnosis, treatment optimization, and

patient care. Medical imaging techniques, such as MRI and CT scans, benefit from machine
learning algorithms that can detect anomalies, assist in diagnosis, and predict disease
progression. Furthermore, predictive models help healthcare providers identify individuals at risk
of certain diseases, enabling early intervention and personalized treatment plans. Natural
language processing (NLP) algorithms facilitate analysis of electronic health records (EHRs) and
medical literature, leading to improved clinical decision-making and research advancements.

Finance:

The finance industry leverages machine learning for fraud detection, risk assessment, algorithmic
trading, and customer service optimization. Fraud detection algorithms analyze transaction data
in real-time, flagging suspicious activities and preventing financial losses. Risk assessment
models utilize historical data to predict market trends, assess creditworthiness, and optimize
investment portfolios. Machine learning-driven chatbots provide personalized financial advice

4
and support, enhancing customer experience and engagement.

Manufacturing:

Machine learning plays a pivotal role in enhancing efficiency, quality control, and predictive
maintenance in manufacturing processes. Predictive maintenance algorithms analyze sensor data
from machinery to predict equipment failures before they occur, reducing downtime and
maintenance costs. Quality control systems employ machine learning techniques to detect defects
in products, ensuring compliance with quality standards. Additionally, supply chain optimization
models optimize inventory management, production scheduling, and logistics, improving overall
operational efficiency.

Marketing and E-commerce:

In marketing and e-commerce, machine learning enables personalized recommendations,

targeted advertising, and customer segmentation. Recommendation systems analyze user
behavior and preferences to recommend products or content tailored to individual interests,
increasing sales and customer satisfaction. Predictive analytics models forecast customer
demand, optimize pricing strategies, and identify market trends, guiding marketing campaigns
and business decisions. Natural language processing algorithms analyze customer feedback and
sentiment, providing valuable insights for product development and brand management.

1.3.2 BIG DATA ANALYSIS

Big Data is also data but with a huge size. Big Data is a term used to describe a collection
of data that is huge and yet growing exponentially with time. In short such data is so large and

5
complex that none of the traditional data management tools can store it or process it efficiently.
Big Data could be found in three forms:
1. Structured
2. Unstructured
3. Semi-structured

Structured: Any data that can be stored, accessed, and processed in the form of a fixed
format is termed 'structured' data. Over the period, talent in computer science has achieved greater
success in developing techniques for working with such kinds of data (where the format is well
known in advance) and also deriving value from it. However, nowadays, we are foreseeing issues
when the size of such data grows to a huge extent, typical sizes being in the range of multiple
zettabytes.

Unstructured: Any data with an unknown form or structure is classified as unstructured

data. In addition to the size being huge, unstructured data poses multiple challenges in terms of its
processing for deriving value out of it. A typical example of unstructured data is a heterogeneous
data source containing a combination of simple text files, images, videos, etc. Nowadays
organizations have a wealth of data available to them but unfortunately, they don't know how to
derive value from it since this data is in its raw form or unstructured format.

Semi-structured: Semi-structured data can contain both forms of data. We can see semi-
structured data as structured in form but it is not defined with e.g. a table definition in relational
DBMS. An example of semi-structured data is data represented in an XML file.

CHARACTERISTICS OF BIG DATA

(i) Volume – The name Big Data itself is related to its enormous size. The size of data plays a very
crucial role in determining its value out of data. Also, whether a particular data can actually be
considered Big Data or not, is dependent upon the volume of data. Hence, 'Volume' is one
characteristic that needs to be considered while dealing with Big Data.

(ii) Variety – The next aspect of Big Data is its variety. Variety refers to heterogeneous sources
6
and the nature of data, both structured and unstructured. In earlier days, spreadsheets and databases
were the only sources of data considered by most applications.

(iii) Velocity – The term 'velocity' refers to the speed of the generation of data. How fast the data
is generated and processed to meet the demands, determines the real potential of the data. Big Data
Velocity deals with the speed at which data flows in from sources like business processes,
application logs, networks, social media sites, sensors, Mobile devices, etc. The flow of data is
massive and continuous.

(iv) Variability – This refers to the inconsistency which can be shown by the data at times Benefits
of Big Data Processing Ability to process Big Data brings in multiple benefits, such as- Businesses
can utilize outside intelligence while taking decisions

Access to social data from search engines and sites like Facebook, and Twitter is enabling
organizations to fine-tune their business strategies. Improved customer service Traditional
customer feedback systems are getting replaced by new systems designed with Big Data
technologies. In these new systems, Big Data and natural language processing technologies are
being used to read and evaluate consumer responses. Early identification of risk to the
product/services, if any Better operational efficiency Big Data technologies can be used for
creating a staging area or landing zone for new data before identifying what data should be moved
to the data warehouse. In addition, such integration of Big Data technologies and data warehouse
helps an organization to offload infrequently accessed data.

7
CHAPTER 2

LITERATURE SURVEY

2.1 INTRODUCTION:

The development of software applications is a complex process that involves writing and testing
code, debugging errors, and ensuring that the final product meets the desired specifications. In
recent years, there has been growing interest in the use of machine learning (ML) techniques,
particularly natural language processing (NLP), to support developers during the coding process.
Review existing research studies and articles, The survey will provide an overview of the current
state of research in this field, including the strengths and limitations of existing approaches, and
will identify areas for future research and development.

2.2 Code Generation Using Machine Learning

Author: Enrique Dehaerne, Bappaditya Dey, Sandip Halder, Stefan De

Abstract:
Modern society relies on complex software applications that consist of millions of lines
written in many programming languages (PLs) by many teams of developers. PLs are difficult to
read and understand quickly so the developers must also document their programs to make them
more maintainable. Mistakes made during the coding phase led to software bugs that can cost time
and money for the software creators and users. A popular technology used by software developers
is a “linter” which flags syntactic errors in code. Auto-formatters will add or remove whitespace
and “newline” characters to code to improve readability. Statement auto-complete tools can
suggest tokens that programmers might write next to improve their productivity. While these
traditional tools can be useful for programmers, most of them can’t help a developer with complex
tasks such as writing understandable code documentation or implementing algorithms.

8
2.3 Applying CodeBERT for Automated Program Repair of Java Simple Bugs
Author: Ehsan Mashhadi, Hadi Hemmati

Abstract:
Software debugging, and program repair are among the most time-consuming and labor-
intensive tasks in software engineering that would benefit a lot from automation. In this paper, the
authors propose a novel automated program repair approach based on CodeBERT, which is a
transformer-based neural architecture pre-trained on a large corpus of source code. The model is
fine-tuned on the ManySStuBs4J small and large datasets to automatically generate the fix codes.
The results show that our technique accurately predicts the fixed codes implemented by the
developers in 19-72% of the cases, depending on the type of datasets, in less than a second per
bug.

2.4 Code Generation using Transformer

Author: Alexey Svyatkovskiy, Shao Kun Deng, Shengyu Fu, Neel Sundaresan

Abstract:
In software development through integrated development environments (IDEs), code completion
is one of the most widely used features. Nevertheless, the majority of integrated development
environments only support completion of methods and APIs, or arguments. In this paper, the
authors introduce IntelliCode Compose – a general purpose multilingual code completion tool
which is capable of predicting sequences of code tokens of arbitrary types, generating up to entire
lines of syntactically correct code. It leverages state of-the-art generative transformer model
trained on 1.2 billion lines of source code in Python, C#, JavaScript and TypeScript programming
languages. IntelliCode Compose is deployed as a cloud-based web service.

9
2.5 A Pre-Trained Model for Programming and Natural Languages
Authors: Zhangyin Feng, Daya Guo, Duyu Tang, Nan Duan, Xiaocheng Feng, Ming Gong, Linjun
Shou, Bing Qin, Ting Liu, Daxin Jiang, Ming Zhou

Abstract:

The authors present CodeBERT, a bimodal pre-trained model for programming language
(PL) and natural language (NL). CodeBERT learns general purpose representations that support
downstream NL-PL applications such as natural language code search, code documentation
generation, etc. CodeBERT was developed with Transformer-based neural architecture, and train
it with a hybrid objective function that incorporates the pre-training task of replaced token
detection, which is to detect plausible alternatives sampled from generators. This enables us to
utilize both “bimodal” data of NLPL pairs and “unimodal” data, where the former provides input
tokens for model training while the latter helps to learn better generators. We evaluate CodeBERT
on two NL-PL applications by fine-tuning model parameters. Results show that CodeBERT
achieves state-of-the-art performance on both natural language code search and code
documentation generation.

10
CHAPTER 3

SYSTEM ANALYSIS

3.1 EXISTING SYSTEM

Data Sources:
Existing systems leverage diverse data sources, including electronic health records (EHRs), medical
imaging, laboratory tests, genetic information, lifestyle factors, and demographic data.Clinical datasets
from hospitals, healthcare institutions, research databases, and public health repositories serve as valuable
sources for training and validating predictive models.

Feature Selection and Engineering:

Feature selection techniques are employed to identify relevant predictors associated with heart attacks,
including demographic factors (age, gender), medical history (hypertension, diabetes), biomarkers
(cholesterol levels, blood pressure), lifestyle factors (smoking, physical activity), and genetic markers.
Feature engineering methods may involve transforming or creating new features to capture complex
relationships and interactions among predictors.

Predictive Modeling Techniques:

Various machine learning algorithms are utilized to develop predictive models for heart attack risk
assessment. Commonly used algorithms include logistic regression, decision trees, random forests,
support vector machines (SVM), gradient boosting, and neural networks.Ensemble methods, such as
bagging and boosting, are often employed to improve model performance and robustness.

Model Evaluation and Validation:

Existing systems undergo rigorous evaluation and validation using standard performance metrics such as
accuracy, sensitivity, specificity, area under the receiver operating characteristic curve (AUC-ROC), and
precision-recall curves.Cross-validation techniques, such as k-fold cross-validation, and external
validation on independent datasets are employed to assess the generalizability and reliability of predictive
models.

Deployment and Integration:

Validated predictive models are integrated into clinical workflows, electronic health record systems,
decision support tools, or mobile applications to assist healthcare providers in risk assessment and
decision-making.User-friendly interfaces and visualization tools may be incorporated to facilitate
interpretation and use of predictive models by clinicians and patients.

11
CHAPTER 4

SYSTEM SPECIFICATION

4.1 SOFTWARE SPECIFICATION

● Visual Studio Code
● Anaconda Distribution
● Python
● Kaggle dataset
● Streamlit
● Machine Learning Language Model

4.1.1 Visual Studio Code

Visual Studio Code (VS Code) is a free and open-source code editor developed by Microsoft. It is
designed to be lightweight, fast, and customizable, making it a popular choice for developers
across a variety of programming languages and platforms. One of the key features of VS Code is
its support for extensions, which allows developers to add functionality and customize the editor
to suit their needs. There are thousands of extensions available, ranging from language-specific
syntax highlighting and autocomplete to tools for debugging, testing, and version control. In
addition to its extensibility, VS Code also includes a range of built-in features to support efficient
coding, such as IntelliSense, which provides context-aware code completion and suggestion, and
a built-in terminal for running commands and scripts. VS Code also includes support for debugging
and profiling, with integrated tools for debugging Node.js, Python, and other languages, as well as
the ability to attach to remote processes for debugging and profiling in a distributed environment.
VS Code is a powerful and versatile code editor that provides a range of features and tools to
support efficient and effective coding. Its popularity among developers is a testament to its
usefulness and flexibility.

12
4.1.2 ANACONDA DISTRIBUTION

Anaconda Distribution stands as one of the most comprehensive and widely used platforms for data
science and machine learning. With its extensive collection of tools, libraries, and packages, Anaconda
simplifies the process of setting up and managing environments for data analysis, statistical modeling, and
AI development. Let's delve into the details of Anaconda Distribution, its components, features, and
benefits.
Components of Anaconda Distribution:
Conda Package Manager:
At the heart of Anaconda Distribution lies Conda, a powerful package manager and environment
manager.Conda facilitates package installation, dependency management, and environment creation
across different operating systems.It enables users to seamlessly install, update, and manage packages and
libraries without worrying about compatibility issues or conflicts.
Python Interpreter:
Anaconda Distribution comes bundled with the Python programming language, providing users with a
robust and versatile platform for data analysis, scripting, and application development.Python's rich
ecosystem of libraries, including NumPy, Pandas, SciPy, Matplotlib, and scikit-learn, makes it an ideal
choice for data science and machine learning projects.
Jupyter Notebook:
Anaconda includes Jupyter Notebook, an interactive computing environment that allows users to create
and share documents containing live code, visualizations, and explanatory text.Jupyter Notebooks support
multiple programming languages, including Python, R, and Julia, making it a versatile tool for data
exploration, prototyping, and collaborative research.
Spyder IDE:
Anaconda Distribution features Spyder, an Integrated Development Environment (IDE) tailored for
scientific computing and data analysis.Spyder provides a user-friendly interface with features such as
code editing, debugging, variable exploration, and integrated IPython console, enhancing productivity for
data scientists and researchers.
Libraries and Packages:
Anaconda Distribution comes pre-installed with a vast array of essential libraries and packages for
data science, machine learning, and scientific computing.These include popular libraries like NumPy,
Pandas, SciPy, Matplotlib, scikit-learn, TensorFlow, PyTorch, Keras, and many others.
13
4.1.3 PYTHON

Python is a versatile and powerful programming language renowned for its simplicity, readability, and
extensive ecosystem of libraries and frameworks. Developed by Guido van Rossum in the late 1980s,
Python has gained widespread adoption across various domains, including web development, data
science, machine learning, artificial intelligence, and scientific computing.Key features of Python include
its clean and intuitive syntax, dynamic typing, automatic memory management, and strong support for
object-oriented, functional, and procedural programming paradigms. Python's extensive standard library
provides modules for tasks ranging from file I/O and networking to web development and GUI
programming.Moreover, Python's vibrant community and open-source ethos have led to the creation of
numerous third-party libraries and frameworks, such as NumPy, Pandas, Matplotlib, TensorFlow, Django,
Flask, and scikit-learn, which further enhance its capabilities and make it a preferred choice for
developers worldwide.

4.1.4 KAGGLE DATASET

Kaggle datasets offer a treasure trove of valuable data resources for data scientists, machine learning
practitioners, and researchers worldwide. As one of the largest platforms for data science competitions
and collaborative projects, Kaggle hosts a diverse collection of datasets spanning multiple domains,
including healthcare, finance, social sciences, biology, and more. Kaggle datasets range from small,
curated datasets suitable for learning and experimentation to large-scale, real-world datasets sourced from
industry partners, government agencies, and research institutions. These datasets cover a wide range of
topics, such as image classification, natural language processing, time series analysis, predictive
modeling, and anomaly detection.One of the key strengths of Kaggle datasets is their accessibility and
usability. Users can easily explore, download, and analyze datasets directly from the Kaggle platform,
leveraging tools like Jupyter notebooks and Kaggle Kernels for data exploration, visualization, and
modeling. Additionally, Kaggle fosters a collaborative community where users can share insights, discuss
methodologies, and collaborate on projects, further enriching the learning and discovery experience.

14
4.1.5 STREAMLIT

Streamlit is a powerful Python library that simplifies the process of building interactive web applications
for data science and machine learning projects. With its intuitive and user-friendly interface, Streamlit
allows developers to create data-driven applications with minimal code, enabling rapid prototyping and
deployment. Streamlit seamlessly integrates with popular data science libraries such as Pandas,
Matplotlib, and scikit-learn, enabling users to create dynamic visualizations, interactive dashboards, and
machine learning models with ease. Whether you're a data scientist, researcher, or developer, Streamlit
empowers you to share insights, engage stakeholders, and deploy your projects seamlessly, making it an
invaluable tool for the data science community.

4.1.6. MACHINE LEARNING LANGUAGE MODEL

A machine learning language model is a type of machine learning algorithm that is specifically
designed to process and analyze natural language data, such as text or speech. These models are
typically trained on large datasets of natural language data, and use statistical and probabilistic
methods to learn patterns and relationships in the data.
One common type of machine learning language model is the neural language model, which uses
artificial neural networks to process and analyze language data. These models are typically trained
on large amounts of text data, such as books, articles, or web pages, and are designed to predict
the probability of a particular word or sequence of words given the context of the surrounding text.
Neural language models are commonly used for a variety of natural language processing tasks,
such as language translation, sentiment analysis, and speech recognition. They are also
increasingly being used for coding assistance and support, with developers using ML language
models to provide suggestions, recommendations, and other forms of assistance during the coding
process.

15
4.2 HARDWARE SPECIFICATION

Hardware environment refers to the physical components that make up a computer system,
including the processor, memory, storage, input/output devices, and other components. The
hardware environment can have a significant impact on the performance and capabilities of a
computer system and is an important consideration when choosing a system for a specific
application.

● Processor: Intel Core i5 or AMD Ryzen

● Memory: 4GB of RAM
● Storage: Solid-state drives (SSDs)

16
CHAPTER 5

SYSTEM DESIGN

5.1 ARCHITECTURE DIAGRAM:

User interface: This is the interface that developers use to interact with the coding companion. It
typically includes features such as code suggestions, error detection and correction, and code
completion.
The back-end component is responsible for processing the code being written by the developer
and providing suggestions and feedback. It uses ML algorithms and techniques to understand the
context of the code and provide relevant suggestions.
The ML model is the core component of the coding companion and is responsible for
understanding the context of the code being written by the developer. This component is trained
on a large corpus of programming language code and is optimized for specific programming
languages and frameworks.
An API layer can be used to facilitate communication between the front-end and back-end
components of the coding companion. This layer can be implemented using a variety of
technologies, such as REST APIs or WebSockets.

Fig 5.1: Architecture diagram

17
5.2 UML DIAGRAMS

Registration: The main class that encapsulates the system and holds references to its
components.
Input Processor: A class responsible for processing the user input, performing code analysis, and
extracting relevant information.
Output Processor: A class responsible for processing the output generated by the ML model,
filtering, and formatting the suggestions for display.
MLModel: A class responsible for training and making predictions using the ML algorithm.

Each class has its own set of methods that are specific to its responsibilities. The Coding
Companion class has methods to execute the system, retrieve user input, display suggestions, train
the model, and update user preferences. The Input Processor class has a method for performing
code analysis, and the Output Processor class has a method for filtering the suggestions. The ML
Model class has methods for training the model using training data and making predictions using
the trained model.

Fig 5.2: UML diagram

18
5.2.1 USE CASE DIAGRAMS:

We have 3 main use cases for our coding companion:

1. Provide suggestions to the user: This use case involves the user entering code, and the
system processing the input to generate relevant suggestions for the user.
2. Train the ML model: This use case involves the user providing training data, and the
system using it to train the ML model.
3. Update user preferences: This use case involves the user updating their preferences, and
the system using these preferences to personalize the suggestions provided to the user.

Additionally, we have identified four main user actions that interact with the system:
1. Enter code: The user enters code to receive suggestions.
2. View suggestions: The user views the suggestions generated by the system.
3. Train model: The user provides training data to train the ML model.
4. Update preferences: The user updates their preferences to personalize the suggestions.

Finally, we have identified four main system actions that are performed by the system to support
the use cases:
1. Process user input: The system processes the user input to generate suggestions.
2. Retrieve suggestions: The system retrieves relevant suggestions based on the user input.
3. Train ML model: The system uses the training data provided by the user to train the ML
model.
4. Update preferences: The system uses the user preferences to personalize the suggestions
generated for the user.

19
Fig 5.2.1: Use Case diagram

5.2.2 CLASS DIAGRAM:

1. Code: This class represents the code entered by the user. It has a code string, a language,
and a list of code suggestions generated by the system.

2. CodeSuggestion: This class represents a code suggestion generated by the system. It has
a suggestion string and a confidence score.

3. MLModel: This class represents the ML model used to generate code suggestions. It has
a list of training data and methods to train the model and generate code suggestions.

4. TrainingData: This class represents the training data used to train the ML model. It has
an input and an output.

20
Fig 5.2.2: Class diagram

5.2.3 SEQUENCE DIAGRAM:

The user enters code and a language into the system.

1. The Coding Companion receives the code and language and calls the suggestCode method
of the ML Model, passing in a Code object representing the user's input.
2. The ML Model processes the Code object to generate a list of code suggestions, and returns
the list to the Coding Companion.
3. The Coding Companion receives the list of code suggestions and displays them to the user.
4. The user can then select a code suggestion to insert into their code.

21
Fig 5.2.3: Sequence diagram

5.2.4 COLLABORATION DIAGRAM:

1. The User enters code and a language into the system.

2. The Coding Companion receives the code and language and calls the suggestCode()
method of the ML Model, passing in a Code object representing the user's input.
3. The ML Model processes the Code object to generate a list of code suggestions, and
updates its internal state with the latest suggestions.
4. The Coding Companion retrieves the list of code suggestions from the ML Model and
displays them to the user.
5. The User selects a code suggestion to insert into their code.
6. The Coding Companion calls the insertCodeSuggestion() method of the ML Model,
passing in the selected Code object.
7. The ML Model updates its internal state with the new information and returns the updated

22
Code object. The Coding Companion updates its internal state with the updated Code
object and displays the updated code to the user

Fig 5.2.4: Collaboration diagram

5.2.5. ACTIVITY DIAGRAM:

1. The User enters patient data into the system.

2. The ml model receives the data and language and calls the extract() method of the
ML Model, passing in a Code object representing the user's input.
3. The ML Model processes the Code object to generate a list of code suggestions and updates
its internal state.
4. The Match Symptoms retrieves the list of code suggestions from the ML Model and
displays them to the user.
5. The User selects and classifies from the list.

23
6. The ML Model updates its internal state with the new information and returns the updated
output object.
The Machine learning model updates its internal state with the updated data object and
displays the output to the user.

Fig 5.2.5: Activity diagram

24
5.2.6 DATA FLOW DIAGRAM:

5.2.6.1 LEVEL 0 DATA FLOW DIAGRAM

Fig 5.2.6.1: L0 Data flow diagram

In this level 0 DFD, there are six components:

User Interface: This component is responsible for capturing user input and displaying suggestions
and feedback provided by the coding companion.
Input Processor: This component receives user input from the user interface and preprocesses it
before passing it to the ML model.
ML Model: This is the core component of the coding companion and is responsible for processing
the user input and generating relevant suggestions and feedback.
Output Processor: This component receives suggestions and feedback generated by the ML
model and processes them before passing them back to the user interface.
Data Store: This component is responsible for storing data related to the coding companion, such
as user preferences, code history, and ML model parameters.
External Data Sources: This component allows the coding companion to access external data
sources, such as version control systems, code libraries, and documentation APIs, to provide
additional functionality and suggestions to the user.

25
5.2.6.2 LEVEL 1 DATA FLOW DIAGRAM

Fig 5.2.6.2: L1 Data flow diagram

In this level 1 DFD, we have added more detail to each component of the coding companion
system:
User Interface: The user interface component captures user input and displays suggestions and
feedback.
Input Processor: The input processor component receives user input, preprocesses it, and
performs code analysis to extract relevant information.
Code Analysis: This process analyzes the code to extract relevant information that can be used to
improve the suggestions and feedback generated by the ML model.
ML Model: This component performs the prediction task based on the training data.
Model Training: This process trains the ML model using relevant data.
Output Processor: The output processor component receives the suggestions and feedback
generated by the ML model, filters, prioritizes and formats them for display.
Data Store: This component stores data related to the coding companion system, such as user
preferences, code history, and ML model parameters.

26
CHAPTER 6

SYSTEM IMPLEMENTATION

6.1 MODULES

● User Interface
● Machine Learning Language Model
● Data Analysis
● Natural Language Processing

6.2 MODULE DESCRIPTION

A module is a separate unit of software or hardware. Characteristics of modular components

include portability and interoperability which allows them to function in another system with the
components of other systems.

6.2.1 User Interface

The user interface (UI) in a coding companion using ML language model is necessary to provide
a seamless and user-friendly experience for the user. Here are some reasons why a UI is needed in
a coding companion using the ML language model. A well-designed UI can make it easier for the
user to interact with the application and make use of its features. The UI can be designed to guide
the user through the various functions and tools of the application.

A well-designed UI can improve user engagement by making the application more

appealing and easier to use. This can help to retain users and encourage them to use the application
more frequently. The UI can be designed to present the suggestions from the ML model in a clear
and concise way. This can help the user to understand the suggestions and make use of them in
their coding. A well-designed UI can make it easy for the user to navigate through the different

27
sections of the application and access the features they need. This can improve the user experience
and increase the efficiency of the coding process.

Fig 6.2.1: User Interface Design

6.2.2 Machine Learning Language Model

Machine learning (ML) plays a crucial role in a coding companion using ML language model. The
ML model can analyze the user's code and suggest improvements or alternative solutions. This can
help the user to write more efficient and effective code. The ML model can detect errors in the
user's code and suggest corrections. This can save the user time and effort in debugging their code.
The ML model can use contextual information to provide more relevant suggestions. For example,
it can suggest code snippets that are relevant to the programming language, library or framework
being used. The ML model can learn from the user's coding style and preferences, and provide
personalized suggestions that are tailored to their needs. The ML model can use natural language
processing (NLP) techniques to understand the user's code comments and provide relevant
suggestions.

28
Fig 6.2.2: Machine Learning Language Model

6.2.3 Data analysis

The data analysis component is responsible for analyzing the data given by the user to identify
any issues such as syntax errors, logical errors, and other potential problems. The data analysis
process consists of three main stages: lexical analysis, syntax analysis, and semantic analysis. The
Lexical Analysis component is responsible for breaking the code down into its fundamental
building blocks, or tokens, such as keywords, identifiers, and literals. This stage involves removing
any whitespace or comments from the code. The Syntax Analysis component is responsible for
analyzing the structure of the code and checking that it conforms to the rules of the programming
language being used. This involves checking that the code is syntactically correct and free from
any syntax errors. The Semantic Analysis component is responsible for analyzing the meaning of
the code and checking that it conforms to the rules of the programming language being used. This
involves checking that the code is semantically correct and free from any logical errors.

29
Fig 6.2.3: data analysis

6.2.4 Natural Language Processing

The NLP component is responsible for processing the natural language input of the user to identify
the intent and provide appropriate responses. The NLP process consists of several stages, including
text preprocessing, text representation, model training, and model inference. The Text
Preprocessing component is responsible for cleaning and normalizing the user's natural language
input. This involves removing any irrelevant information, such as stopwords and punctuation, and
converting the text to a standardized format. The Text Representation component is responsible
for converting the preprocessed text into a format that can be understood by the machine learning
models. This typically involves vectorizing the text into a numerical format, such as a bag-of-
words or word embeddings. The Model Training component is responsible for training the
machine learning models on a dataset of labeled examples. The models can be trained using various
algorithms, such as decision trees, random forests, or neural networks. The Model Inference
component is responsible for applying the trained machine learning models to the user's natural
language input and providing appropriate responses. This involves predicting the intent of the
user's input and generating a response based on the predicted intent.

30
Fig 6.2.4: Natural Language Processing

31
CHAPTER 7

TESTING

7.1 INTRODUCTION

Testing is a process of evaluating the quality or performance of a software, product, or

system by running it through a series of tests. The purpose of testing is to identify any defects or
errors that may exist in the software, product, or system and to ensure that it meets the specified
requirements and works as expected. There are several types of testing, including functional
testing, performance testing, security testing, usability testing, and more. Each type of testing is
designed to test a specific aspect of the software, product, or system and to identify any potential
issues. Testing is a critical part of the software development process as it helps to ensure that the
software meets the requirements and is of high quality. Testing also helps to identify any potential
issues before the software is released, which can save time and resources in the long run.

7.2 TESTING OBJECTIVES

● The primary objective of testing is to find defects or errors in the software, product, or
system being tested. This includes identifying bugs, issues, and other problems that can
affect the performance, functionality, and user experience of the software.
● Testing is also used to ensure that the software or system being tested performs the
functions and operations it is designed to do. This includes verifying that all features and
components work as expected and meet the specified requirements.
● Testing can also help improve the overall quality of the software or system by identifying
areas for improvement and suggesting ways to optimize performance, usability, and
security.
● Testing can help reduce the risk of errors, failures, and security breaches by identifying
potential issues before they become critical problems. This can help prevent downtime,
data loss, and other negative consequences.

32
7.3 TYPES OF TESTING

7.3.1 UNIT TESTING

Unit testing is a type of software testing that focuses on testing individual units or components of
code. In unit testing, each unit of code is tested in isolation to verify that it behaves as expected
and meets the specified requirements. The main objective of unit testing is to ensure that each unit
of code works correctly and integrates smoothly with other units of code. This helps to identify
any defects or errors early in the development cycle, before they become more difficult and
expensive to fix.

Test strategy and approach:

Field testing will be performed manually and functional tests will be written in detail.
Test objectives:
● All field entries must work properly.
● Pages must be activated from the identified link.
● The entry screen, messages and responses must not be delayed.

Features to be tested:
● Verify that the entries are of the correct format.
● No duplicate entries should be allowed.
● All links should take the user to the correct page.

7.3.2 INTEGRATION TESTING

Integration testing is a type of software testing that focuses on verifying the interfaces and
interactions between different modules or components of a software system. The goal of
integration testing is to identify defects and issues that may arise when different modules or
components are combined and to ensure that they work together as expected. In integration testing,
individual units or modules are combined and tested as a group to verify that they work together

33
correctly. This involves testing the interfaces and interactions between modules, as well as the
functionality of the system as a whole.

Integration testing can be performed using a variety of techniques, including top-down, bottom-
up, or a combination of both. In top-down integration testing, higher-level modules are tested first,
followed by lower-level modules. In bottom-up integration testing, lower-level modules are tested
first, followed by higher-level modules. In a combination approach, integration testing is
performed in stages, with some modules tested together and others tested individually.Integration
testing helps to identify defects early in the development cycle, when they are less expensive and
easier to fix. Integration testing helps to ensure that the software components or modules work
correctly and integrate smoothly, improving overall software quality. Integration testing helps to
increase confidence in the software by verifying that it meets the specified requirements and
performs as expected.

Test Results: All the test cases mentioned above passed successfully. No defects
encountered.

34
CHAPTER 8

CONCLUSION AND FUTURE WORK

8.1 CONCLUSION

It is concluded that system will works well and thus it will fulfill the end user’s requirement. The
system is tested and errors are accurately removed. Heart Disease is one of the leading causes of death
worldwide and the early prediction of heart disease is important. The project aims on predicting the
heart disease by using the KNN Algorithm with the help of android application. The probability of
disease is found by using certain datasets and the input given by the user, it also gives the nearby
hospital details and notices the patient about the disease by using messaging application.

8.2 FUTURE ENHANCEMENT

Here are a few potential areas for future enhancement:

● As new programming languages, frameworks, and technologies emerge, coding

companion tools could be developed to support them. This would require ongoing research
and development to stay up-to-date with the latest trends in software development.
● As software development becomes increasingly collaborative, coding companion tools
could be developed with features to support teamwork. For example, tools could be
developed that allow developers to share code snippets and collaborate on coding
challenges in real-time.
● Coding companion tools could be developed to offer personalized recommendations and
suggestions based on a user's coding style and preferences. This could be achieved by
analyzing the user's coding history and providing tailored suggestions for code completion
or optimization.

35
APPENDIX-1
SOURCE CODE
App.py

import os
import pickle
import streamlit as st
from streamlit_option_menu import option_menu

# Set page configuration

st.set_page_config(page_title="Health Assistant",
layout="wide",
page_icon=" ")

# getting the working directory of the main.py

working_dir = os.path.dirname(os.path.abspath(__file__))

# loading the saved models

diabetes_model = pickle.load(open(f'{working_dir}/saved_models/diabetes_model.sav', 'rb'))

heart_disease_model = pickle.load(open(f'{working_dir}/saved_models/heart_disease_model.sav', 'rb'))

parkinsons_model = pickle.load(open(f'{working_dir}/saved_models/parkinsons_model.sav', 'rb'))

# sidebar for navigation

with st.sidebar:
selected = option_menu('Multiple Disease Prediction System',

['Diabetes Prediction',
'Heart Disease Prediction',
'Parkinsons Prediction'],
menu_icon='hospital-fill',
icons=['activity', 'heart', 'person'],
default_index=0)

# Diabetes Prediction Page

if selected == 'Diabetes Prediction':

# page title
st.title('Diabetes Prediction using ML')

36
# getting the input data from the user
col1, col2, col3 = st.columns(3)

with col1:
Pregnancies = st.text_input('Number of Pregnancies')

with col2:
Glucose = st.text_input('Glucose Level')

with col3:
BloodPressure = st.text_input('Blood Pressure value')

with col1:
SkinThickness = st.text_input('Skin Thickness value')

with col2:
Insulin = st.text_input('Insulin Level')

with col3:
BMI = st.text_input('BMI value')

with col1:
DiabetesPedigreeFunction = st.text_input('Diabetes Pedigree Function value')

with col2:
Age = st.text_input('Age of the Person')

# code for Prediction

diab_diagnosis = ''

# creating a button for Prediction

if st.button('Diabetes Test Result'):

user_input = [Pregnancies, Glucose, BloodPressure, SkinThickness, Insulin,

BMI, DiabetesPedigreeFunction, Age]

user_input = [float(x) for x in user_input]

diab_prediction = diabetes_model.predict([user_input])

if diab_prediction[0] == 1:
diab_diagnosis = 'The person is diabetic'
else:
diab_diagnosis = 'The person is not diabetic'

st.success(diab_diagnosis)
37
# Heart Disease Prediction Page
if selected == 'Heart Disease Prediction':

# page title
st.title('Heart Disease Prediction using ML')

col1, col2, col3 = st.columns(3)

with col1:
age = st.text_input('Age')

with col2:
sex = st.text_input('Sex')

with col3:
cp = st.text_input('Chest Pain types')

with col1:
trestbps = st.text_input('Resting Blood Pressure')

with col2:
chol = st.text_input('Serum Cholestoral in mg/dl')

with col3:
fbs = st.text_input('Fasting Blood Sugar > 120 mg/dl')

with col1:
restecg = st.text_input('Resting Electrocardiographic results')

with col2:
thalach = st.text_input('Maximum Heart Rate achieved')

with col3:
exang = st.text_input('Exercise Induced Angina')

with col1:
oldpeak = st.text_input('ST depression induced by exercise')

with col2:
slope = st.text_input('Slope of the peak exercise ST segment')

with col3:
ca = st.text_input('Major vessels colored by flourosopy')

with col1:
thal = st.text_input('thal: 0 = normal; 1 = fixed defect; 2 = reversable defect')

38
# code for Prediction
heart_diagnosis = ''

# creating a button for Prediction

if st.button('Heart Disease Test Result'):

user_input = [age, sex, cp, trestbps, chol, fbs, restecg, thalach, exang, oldpeak, slope, ca, thal]

user_input = [float(x) for x in user_input]

heart_prediction = heart_disease_model.predict([user_input])

if heart_prediction[0] == 1:
heart_diagnosis = 'The person is having heart disease'
else:
heart_diagnosis = 'The person does not have any heart disease'

st.success(heart_diagnosis)

# Parkinson's Prediction Page

if selected == "Parkinsons Prediction":

# page title
st.title("Parkinson's Disease Prediction using ML")

col1, col2, col3, col4, col5 = st.columns(5)

with col1:
fo = st.text_input('MDVP:Fo(Hz)')

with col2:
fhi = st.text_input('MDVP:Fhi(Hz)')

with col3:
flo = st.text_input('MDVP:Flo(Hz)')

with col4:
Jitter_percent = st.text_input('MDVP:Jitter(%)')

with col5:
Jitter_Abs = st.text_input('MDVP:Jitter(Abs)')

with col1:
RAP = st.text_input('MDVP:RAP')

with col2:
PPQ = st.text_input('MDVP:PPQ')
39
with col3:
DDP = st.text_input('Jitter:DDP')

with col4:
Shimmer = st.text_input('MDVP:Shimmer')

with col5:
Shimmer_dB = st.text_input('MDVP:Shimmer(dB)')

with col1:
APQ3 = st.text_input('Shimmer:APQ3')

with col2:
APQ5 = st.text_input('Shimmer:APQ5')

with col3:
APQ = st.text_input('MDVP:APQ')

with col4:
DDA = st.text_input('Shimmer:DDA')

with col5:
NHR = st.text_input('NHR')

with col1:
HNR = st.text_input('HNR')

with col2:
RPDE = st.text_input('RPDE')

with col3:
DFA = st.text_input('DFA')

with col4:
spread1 = st.text_input('spread1')

with col5:
spread2 = st.text_input('spread2')

with col1:
D2 = st.text_input('D2')

with col2:
PPE = st.text_input('PPE')

# code for Prediction

parkinsons_diagnosis = ''
40
# creating a button for Prediction
if st.button("Parkinson's Test Result"):

user_input = [fo, fhi, flo, Jitter_percent, Jitter_Abs,

RAP, PPQ, DDP,Shimmer, Shimmer_dB, APQ3, APQ5,
APQ, DDA, NHR, HNR, RPDE, DFA, spread1, spread2, D2, PPE]

user_input = [float(x) for x in user_input]

parkinsons_prediction = parkinsons_model.predict([user_input])

if parkinsons_prediction[0] == 1:
parkinsons_diagnosis = "The person has Parkinson's disease"
else:
parkinsons_diagnosis = "The person does not have Parkinson's disease"

st.success(parkinsons_diagnosis)

41
APPENDIX-2
SNAPSHOT

Fig 9.1: output page

Fig 9.2: Person have heart attack

42
Fig 9.3: person does not have a heart attack

43
REFERENCES

1. E. Dehaerne, B. Dey, S. Halder, S. De Gendt and W. Meert, "Code Generation Using

Machine Learning: A Systematic Review," in IEEE Access, vol. 10, pp. 82434-82455,
2022, doi: 10.1109/ACCESS.2022.3196347.

2. https://www.researchgate.net/publication/350311935_Applying_CodeBERT_for_Au
tomated_Program_Repair_of_Java_Simple_Bugs

3. Kuhail, M.A., Alturki, N., Alramlawi, S. et al. Interacting with educational chatbots:
A systematic review. Educ Inf Technol 28, 973–1018 (2023).
https://doi.org/10.1007/s10639-022-11177-3

4. King, M.R., chatGPT. A Conversation on Artificial Intelligence, Chatbots, and

Plagiarism in Higher Education. Cel. Mol. Bioeng. 16, 1–2 (2023).
https://doi.org/10.1007/s12195-022-00754-8

5. "Building a Better Chatbot: How Linguistic Choices Affect User Perceptions" by S.

Schaefer, J. R. Jachimowicz, and J. R. Berman. Published in Journal of Business and
Psychology in 2018.

6. "A Survey of Chatbot Design Techniques in Speech and Natural Language

Processing" by R. Dwivedi, T. Prakash, and J. Rathore. Published in Cognitive
Computation in 2020.

44
PROGRAM OUTCOMES

PO Graduate
Program Outcomes (POs)
No. Attribute
Apply the knowledge of mathematics, science, engineering
Engineering
PO1 fundamentals, and an engineering specialization for the
knowledge
solution of complex engineering problems.
Identify, formulate, research literature, and analyze complex
engineering problems reaching substantiated conclusions
PO2 Problem analysis
using first principles of mathematics, natural sciences, and
engineering sciences.
Design solutions for complex engineering problems and
Design/ design system components or processes that meet the
PO3 development of specified needs with appropriate consideration for public
solutions health and safety, and cultural, societal, and environmental
considerations.
Use research-based knowledge and research methods
Conduct
including design of experiments, analysis and interpretation
PO4 investigations of
of data, and synthesis of the information to provide valid
complex Problems
conclusions.
Create, select, and apply appropriate techniques, resources,
and modern engineering and IT tools, including prediction
PO5 Modern tool usage
and modeling to complex engineering activities, with an
understanding of the limitations.
Apply reasoning informed by the contextual knowledge to
The engineer and assess societal, health, safety, legal and cultural issues and the
PO6
society consequent responsibilities relevant to the professional
engineering practice.

45
Understand the impact of the professional engineering
Environment and solutions in societal and environmental contexts, and
PO7
Sustainability demonstrate the knowledge of, and need for sustainable
development.
Apply ethical principles and common to professional ethics
PO8 Ethics
and responsibilities and norms of the engineering practice.
Individual and Function effectively as an individual, and as a member or
PO9
team work leader in diverse teams, and in multidisciplinary settings.
Communicate effectively on complex engineering activities
with the engineering community and with the society at large,
PO10 Communication such as, being able to comprehend and withe effective reports
and design documentation, make effective presentations, and
give and receive clear instructions.
Demonstrate knowledge and understanding of the engineering
Project
and management principles and apply these to one’s own
PO11 management and
work, as a member and leader in a team, to manage projects
finance
and in multidisciplinary environments.
Recognize the need for, and have the preparation and ability
PO12 Life-long learning to engage in independent and life-long learning in the broadest
context of technological change.

Mapping of Program outcomes with the Project titled “HEART ATTACK RISK
PREDICTION USING MACHINE LEARNING.”

PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12

46
PROGRAM SPECIFIC OUTCOMES
B.E COMPUTER SCIENCE AND ENGINEERING

PSO No. Program Specific Outcomes

To analyze, design and develop computing solutions by applying foundational
PSO1
concepts of computer science and engineering.
To apply software engineering principles and practices for developing quality
PSO2
software for scientific and business applications.
To adapt to emerging Information and Communication Technologies (ICT) to
PSO3
innovate ideas and solutions to existing/novel problems

PSO1 PSO2 PSO3

Signature of Guide

1 - Heart Disease Prediction Using Machine Learning
81% (27)
1 - Heart Disease Prediction Using Machine Learning
59 pages
Palo Alto
100% (4)
Palo Alto
121 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
70 pages
HEART DISEASE PREDICTION Using MACHINE LEARNING ALGORITHM Presentation
No ratings yet
HEART DISEASE PREDICTION Using MACHINE LEARNING ALGORITHM Presentation
15 pages
Top 20 Electronic Components Manufacturers in India - ElectronicsB2B - ElectronicsB2B
No ratings yet
Top 20 Electronic Components Manufacturers in India - ElectronicsB2B - ElectronicsB2B
14 pages
M.Tech. (Software Engineering) : Comparitive Analysis of Heart Disease Prediction Using Machine Learning Algorithms
No ratings yet
M.Tech. (Software Engineering) : Comparitive Analysis of Heart Disease Prediction Using Machine Learning Algorithms
68 pages
6th Sem Report - 1
No ratings yet
6th Sem Report - 1
34 pages
Disease Prediction of Adiposity Using ML
No ratings yet
Disease Prediction of Adiposity Using ML
49 pages
PBL CA II Project Report (1) .Docs
No ratings yet
PBL CA II Project Report (1) .Docs
25 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
53 pages
Shubhashshashankfinal
No ratings yet
Shubhashshashankfinal
61 pages
Project
No ratings yet
Project
19 pages
Desktop vs Laptop vs Tablet Guide
No ratings yet
Desktop vs Laptop vs Tablet Guide
11 pages
BDA Final
No ratings yet
BDA Final
33 pages
Stress Detection ML Internship Report
No ratings yet
Stress Detection ML Internship Report
27 pages
International Standard: Norme Internationale
No ratings yet
International Standard: Norme Internationale
11 pages
Heart Disease Prediction Using ML
No ratings yet
Heart Disease Prediction Using ML
48 pages
Genpactreport Tex
No ratings yet
Genpactreport Tex
48 pages
Latexcode
No ratings yet
Latexcode
42 pages
Alternistor Triacs (6-40 Amps)
100% (1)
Alternistor Triacs (6-40 Amps)
10 pages
Internship Report
No ratings yet
Internship Report
37 pages
ML Report - Merged
No ratings yet
ML Report - Merged
17 pages
499 - Report - Cardio Vascular Disease Prediction Using ML and XAi FINAL1
No ratings yet
499 - Report - Cardio Vascular Disease Prediction Using ML and XAi FINAL1
53 pages
Report Bib
No ratings yet
Report Bib
51 pages
Be MJ Report
No ratings yet
Be MJ Report
35 pages
Projectworddoc
No ratings yet
Projectworddoc
56 pages
Mega Report Final
No ratings yet
Mega Report Final
22 pages
Zeroth Review Presentation
No ratings yet
Zeroth Review Presentation
12 pages
286IARP27
No ratings yet
286IARP27
72 pages
Multiple Disease Pridiction Using Machine Learning1
No ratings yet
Multiple Disease Pridiction Using Machine Learning1
48 pages
Latexcode
No ratings yet
Latexcode
45 pages
Modul OSPM-6 (SCM)
0% (1)
Modul OSPM-6 (SCM)
20 pages
Report Heart Disease
No ratings yet
Report Heart Disease
39 pages
Hearts Report Final Pages
No ratings yet
Hearts Report Final Pages
27 pages
Machine Learning for Heart Disease Prediction
No ratings yet
Machine Learning for Heart Disease Prediction
63 pages
Real Time Machine Learning Detection of Heart Disease
No ratings yet
Real Time Machine Learning Detection of Heart Disease
11 pages
ADX125 Manual
No ratings yet
ADX125 Manual
31 pages
Final - Urop - Report - Heart Attack Machine Learning
No ratings yet
Final - Urop - Report - Heart Attack Machine Learning
33 pages
Project Report PDF
No ratings yet
Project Report PDF
54 pages
CNN-Based License Plate Recognition
No ratings yet
CNN-Based License Plate Recognition
6 pages
Latex Code
No ratings yet
Latex Code
46 pages
Maindra
No ratings yet
Maindra
22 pages
Project Word
No ratings yet
Project Word
58 pages
T.John Institute of Technology: Visvesvaraya Technological University
No ratings yet
T.John Institute of Technology: Visvesvaraya Technological University
29 pages
Random Forest Based Heart Disease Prediction System - Front Page
No ratings yet
Random Forest Based Heart Disease Prediction System - Front Page
13 pages
Heart Disease Prediction for CS Majors
No ratings yet
Heart Disease Prediction for CS Majors
82 pages
REPORT
No ratings yet
REPORT
33 pages
E-Office and E-Filing (En)
No ratings yet
E-Office and E-Filing (En)
75 pages
Sample TSReport
No ratings yet
Sample TSReport
32 pages
Compparison of Classification Algorithm For Heart Disease - Predictionpdf
No ratings yet
Compparison of Classification Algorithm For Heart Disease - Predictionpdf
34 pages
95 8396 6.3 (C7064e)
No ratings yet
95 8396 6.3 (C7064e)
12 pages
Heart Disease Prediction Report
No ratings yet
Heart Disease Prediction Report
81 pages
MLP Proj
No ratings yet
MLP Proj
37 pages
MINI PROJECT Kshetrika
No ratings yet
MINI PROJECT Kshetrika
41 pages
In Format GROUP FILE
No ratings yet
In Format GROUP FILE
64 pages
Project Report
No ratings yet
Project Report
46 pages
Heart Disease Prediction Project
No ratings yet
Heart Disease Prediction Project
63 pages
Operating Signals, Check-List For Possible Faults and Troubleshooting (Ups Safepower Evo Ug..)
No ratings yet
Operating Signals, Check-List For Possible Faults and Troubleshooting (Ups Safepower Evo Ug..)
12 pages
Heart Disease Detection: Bachelor of Technology
No ratings yet
Heart Disease Detection: Bachelor of Technology
60 pages
Cccccccccccccccs
No ratings yet
Cccccccccccccccs
32 pages
ReportFormatBScCSE (Updated) Phase 1
No ratings yet
ReportFormatBScCSE (Updated) Phase 1
19 pages
Heart Disease Prediction Using Machine Learning Algorithm
No ratings yet
Heart Disease Prediction Using Machine Learning Algorithm
4 pages
Fristam
No ratings yet
Fristam
21 pages
SST Word
No ratings yet
SST Word
15 pages
Torque-Hose Clamps
No ratings yet
Torque-Hose Clamps
3 pages
BT3277 Project Report
No ratings yet
BT3277 Project Report
19 pages
Quectel Cellular Engine: AT Commands
No ratings yet
Quectel Cellular Engine: AT Commands
14 pages
Intelligent Heart Diseases Prediction System Using Datamining Techniques0
No ratings yet
Intelligent Heart Diseases Prediction System Using Datamining Techniques0
104 pages
Internship Presentation
No ratings yet
Internship Presentation
19 pages
WI Install 24.2.0
No ratings yet
WI Install 24.2.0
40 pages
Prediction of Cardiovasclar Disease Using Machine Learning Algorithm
No ratings yet
Prediction of Cardiovasclar Disease Using Machine Learning Algorithm
8 pages
Project Report Divii
No ratings yet
Project Report Divii
50 pages
Air Compressor Inverter Guide
No ratings yet
Air Compressor Inverter Guide
100 pages
The Importance of Proper Planning For Telecom Procurement: Case Study
No ratings yet
The Importance of Proper Planning For Telecom Procurement: Case Study
8 pages
Technical Catalogo FORMULA
No ratings yet
Technical Catalogo FORMULA
106 pages
2832 PW 6006
No ratings yet
2832 PW 6006
2 pages
Tax Invoice: Billing Address Installation Address Invoice Details
No ratings yet
Tax Invoice: Billing Address Installation Address Invoice Details
1 page
Raspberry Pi Kiosk Mode Setup Guide
No ratings yet
Raspberry Pi Kiosk Mode Setup Guide
4 pages
Grupo de Potência Hidráulica
No ratings yet
Grupo de Potência Hidráulica
24 pages
BPI-Company Profile
No ratings yet
BPI-Company Profile
19 pages
Python Developer Job at Azurity
No ratings yet
Python Developer Job at Azurity
2 pages
Week 1 - Introduction and Basic Concepts of AI
No ratings yet
Week 1 - Introduction and Basic Concepts of AI
2 pages
Advanced Data Loggers for Industry
No ratings yet
Advanced Data Loggers for Industry
4 pages
Mobility Aware Energy Efficient Routing
No ratings yet
Mobility Aware Energy Efficient Routing
6 pages
EDC Lesson Plan
No ratings yet
EDC Lesson Plan
5 pages
Identifying Duplicate Values in An Excel List
No ratings yet
Identifying Duplicate Values in An Excel List
8 pages
Ybs Cat
No ratings yet
Ybs Cat
10 pages