This repository contains the source files of Talend Data Quality libraries.
| Project | Description |
|---|---|
| dataquality-common | Abstractions of data analysis, and low-level utilities such as East Asian text pattern recognition |
| dataquality-converter | Conversion tools for datetime, distance, japanese characters, etc. |
| dataquality-email | Email validation library |
| dataquality-libraries | Parent pom aggregating other library projects |
| dataquality-phone | Phone number validation and conversion tools |
| dataquality-record-linkage | Record Matching algorithms, blocking key calculation and T-Swoosh |
| dataquality-sampling | Reservoir sampling, data duplication |
| dataquality-standardization | Standardization library based on Apache Lucene |
| dataquality-statistics | API for data analysis and statistics |
| dataquality-survivorship | Data survivorship library based on Drools |
| dataquality-text-japanese | API for japanese text analysis |
| dataquality-wordnet | Content validation API based on WordNet dictionary |
Talend Open Studio for Data Quality can be download from the Talend website.
- All project are maven based.
- The parent pom builds all the libraries.
- Support JDK 17 from 17.0.1
Copyright (c) 2006-2023 Talend
Licensed under the Apache Licence v2