NLP_Basics

In the "Deep_learning_for_NLP.ipynb" file, I have tried to cover basics of NLP and followed the book titled "Deep Learning for Natural Language Processing". I will keep updating the current repo.....

Basic NLP models like Count Vectorizer, TF-IDF, Word2Vec, Embedding, Sentiment Analysis, Text Classification, LSTM/BiLSTM, new nlp library basics, Topic Modeling etc... Seq2seq Modeling

Multi-Class Text Classification Model Comparison and Selection

[https://towardsdatascience.com/multi-class-text-classification-model-comparison-and-selection-5eb066197568]

About

Natural Language Processing Performance Metrics [ppt]

Evaluation Metrics: Quick Notes

Average precision

Macro: average of sentence scores
Micro: corpus (sums numerators and denominators for each hypothesis-reference(s) pairs before division)

Machine Translation

BLEU (BiLingual Evaluation Understudy)
- Papineni 2002
- 'Measures how many words overlap in a given translation when compared to a reference translation, giving higher scores to sequential words.' (recall)
- Limitation:
  - Doesn't consider different types of errors (insertions, substitutions, synonyms, paraphrase, stems)
  - Designed to be a corpus measure, so it has undesirable properties when used for single sentences.
GLEU (Google-BLEU)
- Wu et al. 2016
- Minimum of BLEU recall and precision applied to 1, 2, 3 and 4grams
  - Recall: (number of matching n-grams) / (number of total n-grams in the target)
  - Precision: (number of matching n-grams) / (number of total n-grams in generated sequence)
- Correlates well with BLEU metric on a corpus metric but does not have its drawbacks for per sentence reward objective.
- Not to be confused with Generalized Language Evaluation Understanding or Generalized BLEU, also known as GLEU
  - Napoles et al. 2015's ACL paper: Ground Truth for Grammatical Error Correction Metrics
  - Napoles et al. 2016: GLEU Without Tuning
    - Minor adjustment required as the number of references increases.
  - Simple variant of BLEU, it hews much more closely to human judgements.
  - "In MT, an untranslated word or phrase is almost always an error, but in GEC, this is not the case."
    - GLEU: "computes n-gram precisions over the reference but assigns more weight to n-grams that have been correctly changed from the source."
  - Python code
WER (Word Error Rate)
- Levenshtein distance (edit distance) for words: minimum number of edits (insertion, deletions or substitutions) required to change the hypotheses sentence into the reference.
- Range: greater than 0 (ref = hyp), no max range as Automatic Speech Recognizer (ASR) can insert an arbitrary number of words
- $ WER = \frac{S+D+I}{N} = \frac{S+D+I}{S+D+C} $
  - S: number of substitutions, D: number of deletions, I: number of insertions, C: number of the corrects, N: number of words in the reference ($N=S+D+C$)
- WAcc (Word Accuracy) or Word Recognition Rate (WRR): $1 - WER$
- Limitation: provides no details on the nature of translation errors
  - Different errors are treated equally, even thought they might influence the outcome differently (being more disruptive or more difficult/easier to be corrected).
  - If you look at the formula, there's no distinction between a substitution error and a deletion followed by an insertion error.
- Possible solution proposed by Hunt (1990):
  - Use of a weighted measure
  - $ WER = \frac{S+0.5D+0.5I}{N} $
  - Problem:
    - Metric is used to compare systems, so it's unclear whether Hunt's formula could be used to assess the performance of a single system
    - How effective this measure is in helping a user with error correction
- See more info
METEOR (Metric for Evaluation of Translation with Explicit ORdering):
- Banerjee 2005's paper: Meteor: An Automatic Metric for MT Evaluation with High Levels of Correlation with Human Judgments
- About: "based on the harmonic mean of unigram precision and recall (weighted higher than precision)"
- Includes: exact word, stem and synonym matching
- Designed to fix some of the problems found in the BLEU metric, while also producing good correlation with human judgement at the sentence or segment level (unlike BLEU which seeks correlation at the corpus level).
- Python jar wrapper
TER (Translation Edit Rate)
- Snover et al. 2006's paper: A study of translation edit rate with targeted human annotation
- Number of edits (words deletion, addition and substitution) required to make a machine translation match exactly to the closest reference translation in fluency and semantics
- TER = $\frac{E}{R}$ = (minimum number of edits) / (average length of reference text)
- It is generally preferred to BLEU for estimation of sentence post-editing effort. Source.
- PyTER
- char-TER: character level TER

Summarization

ROUGE (Recall-Oriented Understudy for Gisting Evaluation)
- Lin 2004: ROUGE: A Package for Automatic Evaluation of Summaries
- Package for automatic evaluation of summaries

Image Caption Quality

CIDEr (Consensus-based Image Description Evaluation)
- Vedantam et al. 2015: CIDEr: Consensus-based Image Description Evaluation
- Used as a measurement for image caption quality

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NLP_Basics

Multi-Class Text Classification Model Comparison and Selection

About

Evaluation Metrics: Quick Notes

Average precision

Machine Translation

Summarization

Image Caption Quality

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
Dataset		Dataset
assets		assets
NLP_performance_metrics-April6th2018.pdf		NLP_performance_metrics-April6th2018.pdf
Part 0.0: Deep_learning_for_NLP.ipynb		Part 0.0: Deep_learning_for_NLP.ipynb
Part 0.1: Regex.ipynb		Part 0.1: Regex.ipynb
Part 0.2: Text_cleaning_Tourism_data_regex.ipynb		Part 0.2: Text_cleaning_Tourism_data_regex.ipynb
Part 0.3: N_gram.ipynb		Part 0.3: N_gram.ipynb
Part 0.4: LanguageDetection.ipynb		Part 0.4: LanguageDetection.ipynb
Part 1.0: Embedding_Techniques_Text_Classification_Model.ipynb		Part 1.0: Embedding_Techniques_Text_Classification_Model.ipynb
Part 1.1: CountVectorizer_NaiveBayesModel_email_spam_filter.ipynb		Part 1.1: CountVectorizer_NaiveBayesModel_email_spam_filter.ipynb
Part 1.2: TF_IDF_and_Count_Vectorizer_sklearn.ipynb		Part 1.2: TF_IDF_and_Count_Vectorizer_sklearn.ipynb
Part 1.3: Word2Vec_gensim.ipynb		Part 1.3: Word2Vec_gensim.ipynb
Part 1.4: Word2Vec_gensim_t_SNE.ipynb		Part 1.4: Word2Vec_gensim_t_SNE.ipynb
Part 1.5: Word_Embeding_Keras.ipynb		Part 1.5: Word_Embeding_Keras.ipynb
Part 1.6: Negation_SpacyStanza.ipynb		Part 1.6: Negation_SpacyStanza.ipynb
Part 10.1: Train_Custom_Tokenizer_and_Mdoel(Roberta)_using_HuggingFace.ipynb		Part 10.1: Train_Custom_Tokenizer_and_Mdoel(Roberta)_using_HuggingFace.ipynb
Part 2.1: textblob_basics.ipynb		Part 2.1: textblob_basics.ipynb
Part 2.2: Spacy_Custom_Named_Entity_Recognizer.ipynb		Part 2.2: Spacy_Custom_Named_Entity_Recognizer.ipynb
Part 2.3: Stanza_Biomedical_NLP_Demo.ipynb		Part 2.3: Stanza_Biomedical_NLP_Demo.ipynb
Part 2.4: NER_HuggingfaceTrans.ipynb		Part 2.4: NER_HuggingfaceTrans.ipynb
Part 3.1: LSTM_FakeNews_Classifier.ipynb		Part 3.1: LSTM_FakeNews_Classifier.ipynb
Part 3.2: BidirectionalLSTM_FakeNews_Classifier.ipynb		Part 3.2: BidirectionalLSTM_FakeNews_Classifier.ipynb
Part 4.1: Text_Classification_using_TFIDF_AutoML_H2O_scikit_learn.ipynb		Part 4.1: Text_Classification_using_TFIDF_AutoML_H2O_scikit_learn.ipynb
Part 4.2: Sentiment_Analysis_Model_using_GBM_H2O.ipynb		Part 4.2: Sentiment_Analysis_Model_using_GBM_H2O.ipynb
Part 4.3: Sentiment_Analysis_Visualization.ipynb		Part 4.3: Sentiment_Analysis_Visualization.ipynb
Part 4.4: Tensorflow_Sentiment_Analysis.ipynb		Part 4.4: Tensorflow_Sentiment_Analysis.ipynb
Part 4.5: AutoNLP_Sentiment_Analysis.ipynb		Part 4.5: AutoNLP_Sentiment_Analysis.ipynb
Part 4.6: Sentiment_Analysis_TF_Model_Deploying_1.ipynb		Part 4.6: Sentiment_Analysis_TF_Model_Deploying_1.ipynb
Part 5.1: Topic_Modeling_using__LDA__scikit_learn.ipynb		Part 5.1: Topic_Modeling_using__LDA__scikit_learn.ipynb
Part 5.2: TopicModeling_using_LDA_gensim.ipynb		Part 5.2: TopicModeling_using_LDA_gensim.ipynb
Part 5.3: Topic_Modeling_using_NMF_scikit_learn.ipynb		Part 5.3: Topic_Modeling_using_NMF_scikit_learn.ipynb
Part 6.1: End_to_End_Seq2Seq_Text_Generation_Keras.ipynb		Part 6.1: End_to_End_Seq2Seq_Text_Generation_Keras.ipynb
Part 7.2: Speech_to_Text_Demo_Batch_Transcription.ipynb		Part 7.2: Speech_to_Text_Demo_Batch_Transcription.ipynb
Part 7.3: Speech_to_Text_DeepSpeech_Demo_Streaming_Transcription.ipynb		Part 7.3: Speech_to_Text_DeepSpeech_Demo_Streaming_Transcription.ipynb
Part 8.1: cleaning_text_data_NLTK.ipynb		Part 8.1: cleaning_text_data_NLTK.ipynb
Part 8.2: TextHero_NLP.ipynb		Part 8.2: TextHero_NLP.ipynb
Part 8.3: clean_text_library.ipynb		Part 8.3: clean_text_library.ipynb
Part 8.4: NLPretext_library.ipynb		Part 8.4: NLPretext_library.ipynb
Part 9.1: parrot_paraphrasing_text_data_augmentation_demo.ipynb		Part 9.1: parrot_paraphrasing_text_data_augmentation_demo.ipynb
Part 9.2: Gramformer_Demo.ipynb		Part 9.2: Gramformer_Demo.ipynb
Part 9.3: Styleformer_demo.ipynb		Part 9.3: Styleformer_demo.ipynb
Part 9.4: NL_Augmenter_🦎_→_🐍_Write_a_sample_transformation.ipynb		Part 9.4: NL_Augmenter_🦎_→_🐍_Write_a_sample_transformation.ipynb
README.md		README.md
deep_learning_for_nlp (1).pdf		deep_learning_for_nlp (1).pdf

gulabpatel/NLP_Basics

Folders and files

Latest commit

History

Repository files navigation

NLP_Basics

Multi-Class Text Classification Model Comparison and Selection

About

Evaluation Metrics: Quick Notes

Average precision

Machine Translation

Summarization

Image Caption Quality

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages