-
-
Notifications
You must be signed in to change notification settings - Fork 598
Add FuzzySharp-based text analysis plugin for synonym detection, typo correction and entity extraction #1208
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
PR Compliance Guide 🔍(Compliance updated until commit 3162be4)Below is a summary of compliance checks for this PR:
Compliance status legend🟢 - Fully Compliant🟡 - Partial Compliant 🔴 - Not Compliant ⚪ - Requires Further Human Verification 🏷️ - Compliance label Previous compliance checksCompliance check up to commit f3a0101
|
|||||||||||||||||||||||||||||||||||||||||||||||||||
PR Code Suggestions ✨Explore these optional code suggestions:
|
|||||||||||||||||
|
Please remove any business related documents. |
…ping to SynonymMapping - Rename DomainTypes → Sources - Rename DomainTermMapping → SynonymMapping - Update namespace: BotSharp.Abstraction.FuzzSharp → BotSharp.Plugin.FuzzySharp.FuzzSharp - Rename DomainTermMatcher.cs → SynonymMatcher.cs
PR Compliance Guide 🔍Below is a summary of compliance checks for this PR:
Compliance status legend🟢 - Fully Compliant🟡 - Partial Compliant 🔴 - Not Compliant ⚪ - Requires Further Human Verification 🏷️ - Compliance label |
|||||||||||||||||||||||||
PR Code Suggestions ✨Explore these optional code suggestions:
|
||||||||||||
User description
__
PR Type
Enhancement
Description
Add FuzzySharp plugin for text analysis with synonym detection, typo correction, and entity extraction
Implement phrase collection abstraction with CSV-based vocabulary and synonym mapping loading
Create token matching system with priority-based matchers (synonym, exact, fuzzy)
Add n-gram processing and result deduplication for intelligent text analysis
Integrate plugin into solution with API endpoint and dependency injection
Diagram Walkthrough
File Walkthrough
22 files
Define phrase collection interface for vocabulary loadingDefine phrase service interface for text searchDefine search result model with match metadataDefine match type constants for analysis resultsDefine separator and tokenization character constantsAdd API endpoint for text analysisDefine text analysis request parametersDefine n-gram processing interfaceDefine result processing interface for deduplicationDefine token matcher interface and context modelsDefine flagged item model for matchesDefine text analysis response modelImplement plugin registration and dependency injectionImplement CSV-based vocabulary and synonym loadingImplement exact match token matcherImplement fuzzy matching for typo correctionImplement synonym matching with highest priorityImplement phrase service orchestrating analysis pipelineImplement n-gram processing with matcher priorityImplement result deduplication and sorting logicAdd global using statements for pluginImplement text preprocessing and tokenization utilities4 files
Add FuzzySharp plugin project to solutionCreate FuzzySharp plugin project fileAdd FuzzySharp plugin project referenceRegister FuzzySharp plugin in application settings1 files
Add CsvHelper and FuzzySharp package dependencies