0% found this document useful (0 votes)

11 views22 pages

Miniproject NLP

Sppu sem 8 mini project report

Uploaded by

Anushka Shingade

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views22 pages

Miniproject NLP

Sppu sem 8 mini project report

Uploaded by

Anushka Shingade

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Savitribai Phule Pune University

PUNE INSTITUTE OF COMPUTER TECHNOLOGY

2024-2025
DEPARTMENT OF COMPUTER ENGINEERING

A REPORT ON
SENTIMENT ANALYSIS USING BERT
TRANSFORMER

B.E. (COMPUTER ENGINEERING)

Natural Language Processing

SUBMITTED BY

Manasi Mahadev Raut

Roll No: 41468

UNDER THE GUIDANCE OF

Prof. R. R. JADHAV
Problem Statement:

Sentiment Analysis Using BERT Transformer

Problem Statement:

In today's digital world, a massive amount of textual data is generated daily through social media,
product reviews, news articles, and more. Understanding the sentiment behind this data is crucial for
businesses and researchers to make informed decisions. Traditional machine learning approaches
often fall short in capturing context and semantics effectively. Hence, there is a need for a more
robust, context-aware model like BERT (Bidirectional Encoder Representations from Transformers) to
accurately perform sentiment analysis on textual data.

Objectives:

1. To implement a sentiment analysis model using BERT Transformer.

2. To classify the sentiment of textual data into categories such as positive, negative, or neutral.
3. To compare BERT-based performance with traditional machine learning or deep learning
approaches.
4. To evaluate the effectiveness of BERT in understanding the context of natural language.

Outcomes:

1. Successfully built and fine-tuned a BERT-based sentiment analysis model using a publicly
available dataset (e.g., IMDb reviews or Twitter sentiment).

2. Achieved high classification accuracy (typically above 85%) in identifying sentiments as

positive, negative, or neutral.

3. Demonstrated BERT’s ability to understand complex sentence structures and contextual

meanings, outperforming traditional models like Naive Bayes or LSTM.

4. Visualized results using confusion matrix, precision-recall curve, and F1-score, showing
improved performance in handling ambiguous or sarcastic inputs.
Code:
import transformers
from transformers import BertModel, BertTokenizer, AdamW,
get_linear_schedule_with_warmup
import torch

import numpy as np
import pandas as pd
import seaborn as sns
from pylab import rcParams
import matplotlib.pyplot as plt
from matplotlib import rc
from sklearn.model_selection import train_test_split
from sklearn.metrics import confusion_matrix, classification_report
from collections import defaultdict
from textwrap import wrap

from torch import nn, optim

from torch.utils.data import Dataset, DataLoader
import torch.nn.functional as F

%matplotlib inline

device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")

device
Out[0]:
device(type='cuda', index=0)

Data Exploration
In [0]:
df = pd.read_csv("reviews.csv")
df.head()
Out[0]:
user sc thumbs reviewCr reply repl
cont sortO appI
Nam userImage or UpCou eatedVers at Conte ied
ent rder d
e e nt ion nt At

Upd
ate: Accor
Afte ding
202
r to our 202
0-
Andr https://lh3.googl getti TOS, 0-
04- most_ com.
ew eusercontent.co ng a and 04-
0 1 21 4.17.0.3 05 releva anyd
Tho m/a- resp the 05
22: nt o
mas /AOh14GiHd... onse term 15:1
25:
from you 0:24
57
the have
deve ag...
...
user sc thumbs reviewCr reply repl
cont sortO appI
Nam userImage or UpCou eatedVers at Conte ied
ent rder d
e e nt ion nt At

Use
d it
It
for a
202 sound
fair 202
0- s like
https://lh3.googl amo 0-
Craig 04- you most_ com.
eusercontent.co unt 04-
1 Hain 1 11 4.17.0.3 04 logge releva anyd
m/- of 05
es 13: d in nt o
hoe0kwSJgPQ... time 15:1
40: with a
with 1:35
01 differ
out
ent ...
any
...

You
r
This
app
sound
suck 202
s odd! 202
s 0-
steve https://lh3.googl We 0-
now 04- most_ com.
n eusercontent.co are 04-
2 !!!!! 1 17 4.17.0.3 01 releva anyd
adkin m/a- not 02
Use 16: nt o
s /AOh14GiXw... aware 16:0
d to 18:
of any 5:56
be 13
issue..
good
.
but
no...

It
see
We
ms
do
OK, 202
offer 202
but 0-
Lars https://lh3.googl this 0-
very 03- most_ com.
Panz eusercontent.co option 03-
3 basi 1 192 4.17.0.2 12 releva anyd
erbjø m/a-/AOh14Gg- as 15
c. 08: nt o
rn h... part 06:2
Rec 17:
of the 0:13
urrin 34
Adva
g
nce...
tasks
n...

Abs We're
olute 202 sorry
202
ly 0- you
https://lh3.googl 0-
Scott wort 03- feel most_ com.
eusercontent.co 03-
4 Prew hless 1 42 4.17.0.2 14 this releva anyd
m/-K-X1- 15
itt . 17: way! nt o
YsVd6U... 23:4
This 41: 90%
5:51
app 01 of the
runs app ...
a
user sc thumbs reviewCr reply repl
cont sortO appI
Nam userImage or UpCou eatedVers at Conte ied
ent rder d
e e nt ion nt At

proh
ibit..
.

df.shape

(15746, 11)

df.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 15746 entries, 0 to 15745
Data columns (total 11 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 userName 15746 non-null object
1 userImage 15746 non-null object
2 content 15746 non-null object
3 score 15746 non-null int64
4 thumbsUpCount 15746 non-null int64
5 reviewCreatedVersion 13533 non-null object
6 at 15746 non-null object
7 replyContent 7367 non-null object
8 repliedAt 7367 non-null object
9 sortOrder 15746 non-null object
10 appId 15746 non-null object
dtypes: int64(2), object(9)
memory usage: 1.3+ MB
In [0]:
sns.countplot(df.score)
plt.xlabel('review score');
In [0]:
def to_sentiment(rating):
rating = int(rating)
if rating <= 2:
return 0
elif rating == 3:
return 1
else:
return 2

df['sentiment'] = df.score.apply(to_sentiment)
In [0]:
class_names = ['negative', 'neutral', 'positive']
In [0]:
ax = sns.countplot(df.sentiment)
plt.xlabel('review sentiment')
ax.set_xticklabels(class_names);
Data Preprocessing
• Add special tokens to separate sentences and do classification
• Pass sequences of constant length (introduce padding)
• Create array of 0s (pad token) and 1s (real token) called attention mask

BERT
We are using BERT BASE CASED i.e. Case Sensitive BERT BASE Model with 12 Transformer
Encoders stacked .
In [0]:

PRE_TRAINED_MODEL_NAME = 'bert-base-cased'
In [0]:

tokenizer = BertTokenizer.from_pretrained(PRE_TRAINED_MODEL_NAME)

Special Tokens
[SEP] - marker for ending of a sentence
[CLS] - we must add this token to the start of each sentence, so BERT knows we're doing
classification
There is also a special token for padding:
BERT understands tokens that were in the training set. Everything else can be encoded using
the [UNK] (unknown) token:
All of that work can be done using the encode_plus() method:
In [0]:

encoding = tokenizer.encode_plus(
sample_txt,
max_length=32,
add_special_tokens=True, # Add '[CLS]' and '[SEP]'
return_token_type_ids=False,
pad_to_max_length=True,
return_attention_mask=True, #RETURNS 0 FOR PADDINGS
return_tensors='pt', # Return PyTorch tensors
)

encoding.keys()
Out[0]:

dict_keys(['input_ids', 'attention_mask'])
The token ids are now stored in a Tensor and padded to a length of 32:
In [0]:

print(len(encoding['input_ids'][0]))
encoding['input_ids'][0]
32
Out[0]:

tensor([ 101, 1332, 1108, 146, 1314, 1796, 136, 146, 1821, 5342, 1120, 1
313,
1111, 123, 2277, 119, 102, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0])
The attention mask has the same length:
In [0]:

print(len(encoding['attention_mask'][0]))
encoding['attention_mask']
32
Out[0]:

tensor([[1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0,
0, 0,
0, 0, 0, 0, 0, 0, 0, 0]])
We can inverse the tokenization to have a look at the special tokens:
In [0]:

tokenizer.convert_ids_to_tokens(encoding['input_ids'][0])
Out[0]:

['[CLS]',
'When',
'was',
'I',
'last',
'outside',
'?',
'I',
'am',
'stuck',
'at',
'home',
'for',
'2',
'weeks',
'.',
'[SEP]',
'[PAD]',
'[PAD]',
'[PAD]',
'[PAD]',
'[PAD]',
'[PAD]',
'[PAD]',
'[PAD]',
'[PAD]',
'[PAD]',
'[PAD]',
'[PAD]',
'[PAD]',
'[PAD]',
'[PAD]']

Choosing Sequence Length

BERT works with fixed-length sequences. We'll use a simple strategy to choose the max length.
In [0]:

token_lens = []

for txt in df.content:

tokens = tokenizer.encode(txt, max_length=512)
token_lens.append(len(tokens))
In [0]:

sns.distplot(token_lens)
plt.xlim([0, 256]);
plt.xlabel('Token count');
Most of the reviews seem to contain less than 128 tokens, but we'll be on the safe side and
choose a maximum length of 160.
In [0]:

MAX_LEN = 160
In [0]:

class GPReviewDataset(Dataset):

def init(self, reviews, targets, tokenizer, max_len):

self.reviews = reviews
self.targets = targets
self.tokenizer = tokenizer
self.max_len = max_len

def __len__(self):
return len(self.reviews)

def getitem(self, item):

review = str(self.reviews[item])
target = self.targets[item]

encoding = self.tokenizer.encode_plus(
review,
add_special_tokens=True,
max_length=self.max_len,
return_token_type_ids=False,
pad_to_max_length=True,
return_attention_mask=True,
return_tensors='pt',
)
return {
'review_text': review,
'input_ids': encoding['input_ids'].flatten(),
'attention_mask': encoding['attention_mask'].flatten(),
'targets': torch.tensor(target, dtype=torch.long)
}
The tokenizer is doing most of the heavy lifting for us. We also return the review texts, so it'll be
easier to evaluate the predictions from our model. Let's split the data:
In [0]:

df_train, df_test = train_test_split(df, test_size=0.1,

random_state=RANDOM_SEED)
df_val, df_test = train_test_split(df_test, test_size=0.5,
random_state=RANDOM_SEED)
In [0]:

df_train.shape, df_val.shape, df_test.shape

Out[0]:

((14171, 12), (787, 12), (788, 12))

We also need to create a couple of data loaders.
In [0]:

def create_data_loader(df, tokenizer, max_len, batch_size):

ds = GPReviewDataset(
reviews=df.content.to_numpy(),
targets=df.sentiment.to_numpy(),
tokenizer=tokenizer,
max_len=max_len
)

return DataLoader(
ds,
batch_size=batch_size,
num_workers=4
)
In [0]:

BATCH_SIZE = 16

train_data_loader = create_data_loader(df_train, tokenizer, MAX_LEN,

BATCH_SIZE)
val_data_loader = create_data_loader(df_val, tokenizer, MAX_LEN,
BATCH_SIZE)
test_data_loader = create_data_loader(df_test, tokenizer, MAX_LEN,
BATCH_SIZE)
In [0]:

data = next(iter(train_data_loader))
data.keys()
Out[0]:

dict_keys(['review_text', 'input_ids', 'attention_mask', 'targets'])

In [0]:

print(data['input_ids'].shape)
print(data['attention_mask'].shape)
print(data['targets'].shape)
torch.Size([16, 160])
torch.Size([16, 160])
torch.Size([16])

Sentiment Classification with BERT and Hugging Face

In [0]:

bert_model = BertModel.from_pretrained(PRE_TRAINED_MODEL_NAME)
In [0]:

last_hidden_state, pooled_output = bert_model(

input_ids=encoding['input_ids'],
attention_mask=encoding['attention_mask']
)
The last_hidden_state is a sequence of hidden states of the last layer of the model.
In [0]:

last_hidden_state.shape
Out[0]:

torch.Size([1, 32, 768])

We have the hidden state for each of our 32 tokens (the length of our example sequence).
In [0]:

bert_model.config.hidden_size
Out[0]:

768
This is the number of hidden units in the feedforward-networks

Pooled_output is a summary of the content, according to BERT.

In [0]:

pooled_output.shape
Out[0]:

torch.Size([1, 768])
In [0]:

class SentimentClassifier(nn.Module):

def init(self, n_classes):

super(SentimentClassifier, self).__init__()
self.bert = BertModel.from_pretrained(PRE_TRAINED_MODEL_NAME)
self.drop = nn.Dropout(p=0.3)
self.out = nn.Linear(self.bert.config.hidden_size, n_classes)

def forward(self, input_ids, attention_mask):

_, pooled_output = self.bert(
input_ids=input_ids,
attention_mask=attention_mask
)
output = self.drop(pooled_output)
return self.out(output)
Our classifier delegates most of the heavy lifting to the BertModel. We use a dropout layer for
some regularization and a fully-connected layer for our output. Note that we're returning the raw
output of the last layer since that is required for the cross-entropy loss function in PyTorch to
work.

Create an instance and move it to the GPU:

In [0]:

model = SentimentClassifier(len(class_names))
model = model.to(device)
We'll move the example batch of our training data to the GPU:
In [0]:

input_ids = data['input_ids'].to(device)
attention_mask = data['attention_mask'].to(device)

print(input_ids.shape) # batch size x seq length

print(attention_mask.shape) # batch size x seq length
torch.Size([16, 160])
torch.Size([16, 160])
To get the predicted probabilities from our trained model, we'll apply the softmax function to the
outputs:
In [0]:

F.softmax(model(input_ids, attention_mask), dim=1)

Out[0]:

tensor([[0.5879, 0.0842, 0.3279],

[0.4308, 0.1888, 0.3804],
[0.4871, 0.1766, 0.3363],
[0.3364, 0.0778, 0.5858],
[0.4025, 0.1040, 0.4935],
[0.3599, 0.1026, 0.5374],
[0.5054, 0.1552, 0.3394],
[0.5962, 0.1464, 0.2574],
[0.3274, 0.1967, 0.4759],
[0.3026, 0.1118, 0.5856],
[0.4103, 0.1571, 0.4326],
[0.4879, 0.2121, 0.3000],
[0.3811, 0.1477, 0.4712],
[0.3354, 0.1354, 0.5292],
[0.3999, 0.2822, 0.3179],
[0.5075, 0.1684, 0.3242]], device='cuda:0', grad_fn=<SoftmaxBackwar
d>)

Training
We'll use the AdamW optimizer provided by Hugging Face. It corrects weight decay, so it's similar
to the original paper. We'll also use a linear scheduler with no warmup steps:
In [0]:

EPOCHS = 10

optimizer = AdamW(model.parameters(), lr=2e-5, correct_bias=False)

total_steps = len(train_data_loader) * EPOCHS
scheduler = get_linear_schedule_with_warmup(
optimizer,
num_warmup_steps=0,
num_training_steps=total_steps
)

loss_fn = nn.CrossEntropyLoss().to(device)
In [0]:

def train_epoch(model, data_loader, loss_fn, optimizer, device, scheduler,

n_examples):
model = model.train()

losses = []
correct_predictions = 0

for d in data_loader:
input_ids = d["input_ids"].to(device)
attention_mask = d["attention_mask"].to(device)
targets = d["targets"].to(device)

outputs = model(
input_ids=input_ids,
attention_mask=attention_mask
)

_, preds = torch.max(outputs, dim=1)

loss = loss_fn(outputs, targets)

correct_predictions += torch.sum(preds == targets)

losses.append(loss.item())

loss.backward()
nn.utils.clip_grad_norm_(model.parameters(), max_norm=1.0)
optimizer.step()
scheduler.step()
optimizer.zero_grad()

return correct_predictions.double() / n_examples, np.mean(losses)

Training the model should look familiar, except for two things. The scheduler gets called every
time a batch is fed to the model. We're avoiding exploding gradients by clipping the gradients of
the model using clip_grad_norm_.
In [0]:

def eval_model(model, data_loader, loss_fn, device, n_examples):

model = model.eval()

losses = []
correct_predictions = 0

with torch.no_grad():
for d in data_loader:
input_ids = d["input_ids"].to(device)
attention_mask = d["attention_mask"].to(device)
targets = d["targets"].to(device)
outputs = model(
input_ids=input_ids,
attention_mask=attention_mask
)
_, preds = torch.max(outputs, dim=1)

loss = loss_fn(outputs, targets)

correct_predictions += torch.sum(preds == targets)

losses.append(loss.item())

return correct_predictions.double() / n_examples, np.mean(losses)

Using those two, we can write our training loop. We'll also store the training history:
In [0]:

%%time

history = defaultdict(list)
best_accuracy = 0

for epoch in range(EPOCHS):

print(f'Epoch {epoch + 1}/{EPOCHS}')

print('-' * 10)

train_acc, train_loss = train_epoch(

model,
train_data_loader,
loss_fn,
optimizer,
device,
scheduler,
len(df_train)
)

print(f'Train loss {train_loss} accuracy {train_acc}')

val_acc, val_loss = eval_model(

model,
val_data_loader,
loss_fn,
device,
len(df_val)
)

print(f'Val loss {val_loss} accuracy {val_acc}')

print()

history['train_acc'].append(train_acc)
history['train_loss'].append(train_loss)
history['val_acc'].append(val_acc)
history['val_loss'].append(val_loss)

if val_acc > best_accuracy:

torch.save(model.state_dict(), 'best_model_state.bin')
best_accuracy = val_acc
Epoch 1/10
----------
Train loss 0.7330631300571541 accuracy 0.6653729447463129
Val loss 0.5767546480894089 accuracy 0.7776365946632783

Epoch 2/10
----------
Train loss 0.4158683338330777 accuracy 0.8420012701997036
Val loss 0.5365073362737894 accuracy 0.832274459974587

Epoch 3/10
----------
Train loss 0.24015077009679367 accuracy 0.922023851527768
Val loss 0.5074492372572422 accuracy 0.8716645489199493

Epoch 4/10
----------
Train loss 0.16012676668187295 accuracy 0.9546962105708843
Val loss 0.6009970247745514 accuracy 0.8703939008894537

Epoch 5/10
----------
Train loss 0.11209654617575301 accuracy 0.9675393409074872
Val loss 0.7367783848941326 accuracy 0.8742058449809403

Epoch 6/10
----------
Train loss 0.08572274737026433 accuracy 0.9764307388328276
Val loss 0.7251267762482166 accuracy 0.8843710292249047

Epoch 7/10
----------
Train loss 0.06132202987342602 accuracy 0.9833462705525369
Val loss 0.7083295831084251 accuracy 0.889453621346887

Epoch 8/10
----------
Train loss 0.050604159273123096 accuracy 0.9849693035071626
Val loss 0.753860274553299 accuracy 0.8907242693773825

Epoch 9/10
----------
Train loss 0.04373276197092931 accuracy 0.9862395032107826
Val loss 0.7506809896230697 accuracy 0.8919949174078781

Epoch 10/10
----------
Train loss 0.03768671146314381 accuracy 0.9880036694658105
Val loss 0.7431786182522774 accuracy 0.8932655654383737

CPU times: user 29min 54s, sys: 13min 28s, total: 43min 23s
Wall time: 43min 43s
In [0]:

plt.plot(history['train_acc'], label='train accuracy')

plt.plot(history['val_acc'], label='validation accuracy')
plt.title('Training history')
plt.ylabel('Accuracy')
plt.xlabel('Epoch')
plt.legend()
plt.ylim([0, 1]);
Evaluation
In [0]:
test_acc, _ = eval_model(
model,
test_data_loader,
loss_fn,
device,
len(df_test)
)

test_acc.item()
Out[0]:
0.883248730964467
In [0]:
def get_predictions(model, data_loader):
model = model.eval()

review_texts = []
predictions = []
prediction_probs = []
real_values = []

with torch.no_grad():
for d in data_loader:

texts = d["review_text"]
input_ids = d["input_ids"].to(device)
attention_mask = d["attention_mask"].to(device)
targets = d["targets"].to(device)

outputs = model(
input_ids=input_ids,
attention_mask=attention_mask
)
_, preds = torch.max(outputs, dim=1)

probs = F.softmax(outputs, dim=1)

review_texts.extend(texts)
predictions.extend(preds)
prediction_probs.extend(probs)
real_values.extend(targets)

predictions = torch.stack(predictions).cpu()
prediction_probs = torch.stack(prediction_probs).cpu()
real_values = torch.stack(real_values).cpu()
return review_texts, predictions, prediction_probs, real_values
In [0]:
y_review_texts, y_pred, y_pred_probs, y_test = get_predictions(
model,
test_data_loader
)
Let's have a look at the classification report
In [0]:
print(classification_report(y_test, y_pred, target_names=class_names))
precision recall f1-score support

negative 0.89 0.87 0.88 245

neutral 0.83 0.85 0.84 254
positive 0.92 0.93 0.92 289

accuracy 0.88 788

macro avg 0.88 0.88 0.88 788
weighted avg 0.88 0.88 0.88 788

In [0]:
def show_confusion_matrix(confusion_matrix):
hmap = sns.heatmap(confusion_matrix, annot=True, fmt="d", cmap="Blues")
hmap.yaxis.set_ticklabels(hmap.yaxis.get_ticklabels(), rotation=0,
ha='right')
hmap.xaxis.set_ticklabels(hmap.xaxis.get_ticklabels(), rotation=30,
ha='right')
plt.ylabel('True sentiment')
plt.xlabel('Predicted sentiment');

cm = confusion_matrix(y_test, y_pred)
df_cm = pd.DataFrame(cm, index=class_names, columns=class_names)
show_confusion_matrix(df_cm)
This confirms that our model is having difficulty classifying neutral reviews. It mistakes those for
negative and positive at a roughly equal frequency.

That's a good overview of the performance of our model.

In [0]:
idx = 2

review_text = y_review_texts[idx]
true_sentiment = y_test[idx]
pred_df = pd.DataFrame({
'class_names': class_names,
'values': y_pred_probs[idx]
})
In [0]:
print("\n".join(wrap(review_text)))
print()
print(f'True sentiment: {class_names[true_sentiment]}')
I used to use Habitica, and I must say this is a great step up. I'd
like to see more social features, such as sharing tasks - only one
person has to perform said task for it to be checked off, but only
giving that person the experience and gold. Otherwise, the price for
subscription is too steep, thus resulting in a sub-perfect score. I
could easily justify $0.99/month or eternal subscription for $15. If
that price could be met, as well as fine tuning, this would be easily
worth 5 stars.

True sentiment: neutral

Now we can look at the confidence of each sentiment of our model:
In [0]:
sns.barplot(x='values', y='class_names', data=pred_df, orient='h')
plt.ylabel('sentiment')
plt.xlabel('probability')
plt.xlim([0, 1]);

Predicting on Raw Text

Let's use our model to predict the sentiment of some raw text:
In [0]:
review_text = "I love completing my todos! Best app ever!!!"
We have to use the tokenizer to encode the text:
In [0]:
encoded_review = tokenizer.encode_plus(
review_text,
max_length=MAX_LEN,
add_special_tokens=True,
return_token_type_ids=False,
pad_to_max_length=True,
return_attention_mask=True,
return_tensors='pt',
)
Let's get the predictions from our model:
In [0]:
input_ids = encoded_review['input_ids'].to(device)
attention_mask = encoded_review['attention_mask'].to(device)

output = model(input_ids, attention_mask)

_, prediction = torch.max(output, dim=1)

print(f'Review text: {review_text}')

print(f'Sentiment : {class_names[prediction]}')
Review text: I love completing my todos! Best app ever!!!
Sentiment : positive

Conclusion:

This assignment demonstrates the effectiveness of BERT Transformer in sentiment analysis tasks.
Unlike traditional models, BERT captures the context from both directions in a sentence, resulting in
more accurate sentiment predictions. The model shows strong performance on benchmark datasets
and proves to be a reliable solution for real-world sentiment classification problems.

Energy and Policy Considerations For Modern Deep Learning Research
No ratings yet
Energy and Policy Considerations For Modern Deep Learning Research
4 pages
Analyzing The Performance of Sentiment Analysis Using BERT DistilBERT and RoBERTa
No ratings yet
Analyzing The Performance of Sentiment Analysis Using BERT DistilBERT and RoBERTa
6 pages
3-Sentiment Analysis BERT
No ratings yet
3-Sentiment Analysis BERT
5 pages
Key Data Extraction and Emotion Analysis of Digital Shopping Based On BERT
No ratings yet
Key Data Extraction and Emotion Analysis of Digital Shopping Based On BERT
14 pages
BERT-Based Emotion Analysis in E-Commerce
No ratings yet
BERT-Based Emotion Analysis in E-Commerce
12 pages
PhD Thesis: Sentiment Analysis with CNN
No ratings yet
PhD Thesis: Sentiment Analysis with CNN
282 pages
Maneesha Nidigonda Verzeo Major Project
No ratings yet
Maneesha Nidigonda Verzeo Major Project
11 pages
Visualizing BERT & NLP Advances
No ratings yet
Visualizing BERT & NLP Advances
19 pages
Hugging Face
100% (1)
Hugging Face
11 pages
Maneesha Nidigonda Major Project
No ratings yet
Maneesha Nidigonda Major Project
11 pages
Machine Learning Assignment Guide
No ratings yet
Machine Learning Assignment Guide
6 pages
Twitter Sentiment Analysis
No ratings yet
Twitter Sentiment Analysis
13 pages
Thesis - Aru Omarali
No ratings yet
Thesis - Aru Omarali
34 pages
Complete Report
No ratings yet
Complete Report
56 pages
A E A T - B L M: E O M: Nalysis of The Volution of Dvanced Ransformer Ased Anguage Odels Xperiments On Pinion Ining
No ratings yet
A E A T - B L M: E O M: Nalysis of The Volution of Dvanced Ransformer Ased Anguage Odels Xperiments On Pinion Ining
16 pages
BERT A Review of Applications in Sentiment Analysis
No ratings yet
BERT A Review of Applications in Sentiment Analysis
10 pages
Real-Time Detection of Spelling Mistakes in Handwritten Notes
No ratings yet
Real-Time Detection of Spelling Mistakes in Handwritten Notes
70 pages
An N-gram-Based BERT Model For Sentiment Classification Using Movie Reviews
No ratings yet
An N-gram-Based BERT Model For Sentiment Classification Using Movie Reviews
6 pages
Shsconf Icssed2023 04007
No ratings yet
Shsconf Icssed2023 04007
5 pages
SEO Mastery with Chat GPT
No ratings yet
SEO Mastery with Chat GPT
159 pages
Twitter Sentiment Analysis Guide
No ratings yet
Twitter Sentiment Analysis Guide
7 pages
Report Sentiment Analysis Marcos Matheus
No ratings yet
Report Sentiment Analysis Marcos Matheus
12 pages
Hugging Face
No ratings yet
Hugging Face
1 page
Deep Learning Based Sentiment
No ratings yet
Deep Learning Based Sentiment
62 pages
NLP Transformer-Based Models Used For Sentiment Analysis: 1. BERT
No ratings yet
NLP Transformer-Based Models Used For Sentiment Analysis: 1. BERT
98 pages
A Natural Language Processing For Sentiment Analysis From Text Using Deep Learning Algorithm
No ratings yet
A Natural Language Processing For Sentiment Analysis From Text Using Deep Learning Algorithm
7 pages
Exp 10 Sentiment Analysis BERT
No ratings yet
Exp 10 Sentiment Analysis BERT
5 pages
Transfer Learning & NLP Tools
No ratings yet
Transfer Learning & NLP Tools
34 pages
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-10-07 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-10-07 Reference-Material-I
18 pages
Sentiment Analysis Using NLP
No ratings yet
Sentiment Analysis Using NLP
42 pages
Retrieval Augmented Generation - A Simple Introduction
No ratings yet
Retrieval Augmented Generation - A Simple Introduction
82 pages
Sentiment Analysis On User-Generated Tweets
No ratings yet
Sentiment Analysis On User-Generated Tweets
15 pages
Transformer Models for Sentiment Analysis
No ratings yet
Transformer Models for Sentiment Analysis
45 pages
AI Phase2
No ratings yet
AI Phase2
64 pages
Maslej-Krešňáková Et Al. - 2020 - Comparison of Deep Learning Models and Various Text Pre-Processing Techniques For The Toxic Comments C-Annotated
No ratings yet
Maslej-Krešňáková Et Al. - 2020 - Comparison of Deep Learning Models and Various Text Pre-Processing Techniques For The Toxic Comments C-Annotated
26 pages
Peer Feedback's Impact on Burnout
No ratings yet
Peer Feedback's Impact on Burnout
18 pages
Al Phase3
No ratings yet
Al Phase3
9 pages
Poster Version Final Bis
No ratings yet
Poster Version Final Bis
1 page
BT4431 Report of Project Ete 7TH Sem Plag Report Attachted
No ratings yet
BT4431 Report of Project Ete 7TH Sem Plag Report Attachted
69 pages
Bert Bilstm Lstm联合抽取
No ratings yet
Bert Bilstm Lstm联合抽取
11 pages
Text Classification - Movie Review - News Wires
No ratings yet
Text Classification - Movie Review - News Wires
5 pages
BERT Sentiment Analysis Twitter
No ratings yet
BERT Sentiment Analysis Twitter
11 pages
BERT CNN A Deep Learning Model For Detec Paper 2
No ratings yet
BERT CNN A Deep Learning Model For Detec Paper 2
19 pages
Literature Survey of s7 Project
No ratings yet
Literature Survey of s7 Project
10 pages
2019 BERT Stock Market
No ratings yet
2019 BERT Stock Market
5 pages
DL CT3
No ratings yet
DL CT3
8 pages
Adobe Scan 08 Jan 2025
No ratings yet
Adobe Scan 08 Jan 2025
7 pages
Fake News Detection with ML Models
No ratings yet
Fake News Detection with ML Models
8 pages
Virtual Machine Allocation Policy For Load Balanci
No ratings yet
Virtual Machine Allocation Policy For Load Balanci
14 pages
Sentiment Analysis of Social Media With Python - by Haaya Naushan - Towards Data Science
No ratings yet
Sentiment Analysis of Social Media With Python - by Haaya Naushan - Towards Data Science
9 pages
Transformer Models Overview for NLP
No ratings yet
Transformer Models Overview for NLP
5 pages
Week 3 - LLM - PreTraining
No ratings yet
Week 3 - LLM - PreTraining
41 pages
Complex Engineering Activity
No ratings yet
Complex Engineering Activity
2 pages
Automated Grading Model With Adjusted Level of Lenience For Short Answer Questions Using Natural Language Processing
No ratings yet
Automated Grading Model With Adjusted Level of Lenience For Short Answer Questions Using Natural Language Processing
8 pages
AComparative Study of Machine Learning and Deep Learning Techniques For Fake News Detection
No ratings yet
AComparative Study of Machine Learning and Deep Learning Techniques For Fake News Detection
28 pages
Emotion Detection in Text Advances in Sentiment Analysis
No ratings yet
Emotion Detection in Text Advances in Sentiment Analysis
9 pages
Bert Ayman
No ratings yet
Bert Ayman
5 pages
Email Spam Detection Using Machine Learning
No ratings yet
Email Spam Detection Using Machine Learning
10 pages
Semantic Search
No ratings yet
Semantic Search
9 pages
Secure VM Allocation Policies
No ratings yet
Secure VM Allocation Policies
17 pages
Optimization of Sentiment Analysis Using BERT
No ratings yet
Optimization of Sentiment Analysis Using BERT
5 pages
Report
No ratings yet
Report
21 pages
SocrAI Day 3
No ratings yet
SocrAI Day 3
43 pages
Error Analysis of Emotion Detection Using BERT
No ratings yet
Error Analysis of Emotion Detection Using BERT
8 pages
Aneri 2017 IOP Conf. Ser. Mater. Sci. Eng. 263 042016
No ratings yet
Aneri 2017 IOP Conf. Ser. Mater. Sci. Eng. 263 042016
12 pages
Project Handout
No ratings yet
Project Handout
30 pages
Ijatcse 20942020
No ratings yet
Ijatcse 20942020
7 pages
Artificial Intelligence Driven Robotic Control System For Personalized Elderly Care and Foot Massage
No ratings yet
Artificial Intelligence Driven Robotic Control System For Personalized Elderly Care and Foot Massage
13 pages
Lee Intersp23
No ratings yet
Lee Intersp23
5 pages
NLPNEW
No ratings yet
NLPNEW
3 pages
Neunet D 25 00110
No ratings yet
Neunet D 25 00110
40 pages
Detection of Hate Speech and Offensive Language CodeMix Text in Dravidian Languages Using Cost-Sensitive Learning Approach
No ratings yet
Detection of Hate Speech and Offensive Language CodeMix Text in Dravidian Languages Using Cost-Sensitive Learning Approach
27 pages
Lab 6,7
No ratings yet
Lab 6,7
5 pages
Group3 POC Assignment 3
No ratings yet
Group3 POC Assignment 3
9 pages
Sentiment Analysis Task On Twitter Data
No ratings yet
Sentiment Analysis Task On Twitter Data
6 pages
DS - Lab Report.
No ratings yet
DS - Lab Report.
25 pages
1) Sarcasm Detection in Online Comments Using
No ratings yet
1) Sarcasm Detection in Online Comments Using
14 pages
Interactive Dense Retrieval and Query Refinement Systems - A Synergistic Approach To Information Retrieval
No ratings yet
Interactive Dense Retrieval and Query Refinement Systems - A Synergistic Approach To Information Retrieval
22 pages
Generative AI For Software Development by Balasubramaniam S
No ratings yet
Generative AI For Software Development by Balasubramaniam S
330 pages
Praveen Phase 3
No ratings yet
Praveen Phase 3
6 pages
NLP A2
No ratings yet
NLP A2
7 pages
Intelligent Classification and Personalized Recommendation of E-Commerce Products Based On Machine Learning
No ratings yet
Intelligent Classification and Personalized Recommendation of E-Commerce Products Based On Machine Learning
7 pages
NLP Syllabus
No ratings yet
NLP Syllabus
7 pages
MiniProject BI
No ratings yet
MiniProject BI
16 pages
100 AI Algorithms
No ratings yet
100 AI Algorithms
5 pages
(21-SE-60 & 40) Research Paper
No ratings yet
(21-SE-60 & 40) Research Paper
9 pages
Iccmst
No ratings yet
Iccmst
11 pages
Exploring NLP and LLMs
No ratings yet
Exploring NLP and LLMs
15 pages
Text Mining of Stocktwits Data For Predicting Stock Prices
No ratings yet
Text Mining of Stocktwits Data For Predicting Stock Prices
22 pages
SML 1
No ratings yet
SML 1
16 pages
NM Presentation
No ratings yet
NM Presentation
14 pages
GeneativeAI Interview
No ratings yet
GeneativeAI Interview
36 pages