Thanks to visit codestin.com
Credit goes to github.com

Skip to content

souhaibdadi/DataQuality

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Dataquality :

Le traitement DataQuality permet de lancer des checks de formats de donnée. Le traitement se base sur Spark. Les checks sont donc parallele et distribués.

Exemple de configuration de compagne de qualité de donnée :

Pivot = {
  location = {type = "HBase", table = "dco_edma:InterlocuteurClient"},
  fieldsChecks = [
    {
      fieldName = "d:Nom"
      mandatory = true,
      checkes = [
        {id = "1", type = "regex", regex = "\"(?!MAIRE|DEPUTE|PRESIDENT|PDG|GERANT).+\""},
        {id = "2", type = "regex", regex = "\"(?!(\\s)).*"},
        {id = "3", type = "regex", regex = "\"(?!\\s)[A-Za-z\\s-]*\""},
        {id = "4", type = "regex", regex = "\"((?!M.\\s|MR.\\s|MAD.\\s|MME.\\s|MME.\\s|MLE.\\s).)+\""},
        {id = "5", type = "regex", regex = "\".*(?!(A supprimer)).*"},
        {id = "6", type = "regex", regex = "\".*(?!(A qualifier)).*"},
        {id = "7", type = "regex", regex = "\".*(?!(Néant)).*"}
      ]
    }
  ]
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published