Thanks to visit codestin.com
Credit goes to github.com

Skip to content

This repository consists of the basic techniques used in Data and Web Mining

License

Notifications You must be signed in to change notification settings

saxenism/TextAnalysisBasics

Repository files navigation

Overview

Following is the DWM Project implemented by the 5th Sem CSE Department of IIIT Bhubaneswar.

Features implemented:

  • Data Visualisation
    alt text
  • Data Cleaning
    alt text alt text
  • Tokenisation
  • Stemming
    alt text
  • Data Wrangling and Pre-processing
  • Creation of Term-incidence Matrix
    alt text
  • Creation of Tf-Idf Matrix
    alt text
  • Implementation of n-grams
    alt text
  • Implemented SVD to decrease the sparsity of our training matrix\
    • Table of Content alt text
    • Summary of the SVD Table alt text
  • Creation of Cosine similarity on the vector space. alt text

About

This repository consists of the basic techniques used in Data and Web Mining

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages