Thanks to visit codestin.com
Credit goes to github.com

Skip to content

maxsom/data-quality

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

#alt text

Data Quality Libraries

This repository contains the source files of Talend Data Quality libraries.

Content structure

Project Description
dataquality-common Abstractions of data analysis, and low-level utilities such as East Asian text pattern recognition
dataquality-converter Conversion tools for datetime, distance, japanese characters, etc.
dataquality-email Email validation library
dataquality-libraries Parent pom aggregating other library projects
dataquality-phone Phone number validation and conversion tools
dataquality-record-linkage Record Matching algorithms, blocking key calculation and T-Swoosh
dataquality-sampling Reservoir sampling, data duplication
dataquality-standardization Standardization library based on Apache Lucene
dataquality-statistics API for data analysis and statistics
dataquality-survivorship Data survivorship library based on Drools
dataquality-text-japanese API for japanese text analysis
dataquality-wordnet Content validation API based on WordNet dictionary

Product Download

Talend Open Studio for Data Quality can be download from the Talend website.

Build

  • All project are maven based.
  • The parent pom builds all the libraries.
  • Support JDK 17 from 17.0.1

License

Copyright (c) 2006-2023 Talend

Licensed under the Apache Licence v2

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Java 99.7%
  • Other 0.3%