Perform sentiment analysis on a subset of a larger Twitter dataset. Identify trends, major topics of discussion, and their associated sentiments.
https://www.kaggle.com/datasets/kazanova/sentiment140
- There were 1600000 rows and 6 columns, no null values present in datasets.

- From the above graph , we can infer that distribution of negative and positive words are approximately same.
Defined a function toremove any url ,punctuation , @ marks from the twittts and lemmetized the text for further model development.
Developed a Naive Bayes , Logistic classification, Support Vector Machine and Random Forest classifier models , among which logistic classifier performed best with accuracy of 80%.

Also developed a RNN model for classifying any tweet as negative and positive , upon which we can make an app by which we can judge whether the tweet is negative or positive.The accuracy of the model came out to 77.28 percent which was quite good.

Also we have got some positive and negative trending words from datasets. By help of these word we could know what are possible trends which are going on in the world .
