Thanks to visit codestin.com
Credit goes to github.com

Skip to content

kbenoit/tosca

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

tosca

Tools for Statistical Content Analysis
created at TU Dortmund University.

About

tosca is a framework for statistical methods in content analysis. We offer a pipeline for preprocessing, model text corpora using a link to the implemantation of Latent Dirichlet Allocation from the lda package. Useful plot routines for both - pre- and post-modeled corpora - are given for the descriptive analysis of text corpora and topic models. Moreover, an implementation of Chang's intruder words and intruder topics is provided; as well as reasoned sampling of text ids to get effective sets of texts for human labeling/coding regarding accuracy of estimating Precision and Recall.

Installation

See examples how to use tosca at the Vignette.

Citation

For a BibTeX entry please use citation(package = "tosca").

Contribution

This R package is licensed under the GPLv3. For wishes, issues, and bugs please use the issue tracker.

Build Status Coverage Status CRAN Status Badge CRAN Downloads Total Downloads DOI

About

tools for statistical content analysis

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 93.7%
  • HTML 6.3%