Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Zmunshaw/PortfolioSite

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

68 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Source Code for my portfolio site


It is still very much a work in progress but I'm using this as an opportunity to learn more about git and actually make something "in public" instead of deleting it once I'm done with it.


Backend

--

Backend for my portfolio site, public because is demo code, I guess.

Features

  • .NET EntityFramework

  • AI Driven Search

    • Dense vectors for semantic meaning on chunks of words within a document.
    • Sparse vectors for keywords, finding "like" keywords.
    • Exact Keyword + sparse L2(taxi cab) distance + dense Cosine distance scoring system.
    • [PLANNED] Inclusion of a PageRank adjecent algorythm for page authority.
  • Data Collection

    • Go and Python based spider
    • Sitemap traversal and extraction
    • Page content extraction
    • Data Sanitization

  • Data Embedding

    • Collected text from pages is converted into sparse and dense vectors
      • text is broken into words and converted to sprase embeddings using Splade for keyword searches.
      • then whole text is then split into large chunks and converted to dense embeddings using IBM Granite Embedding for context searches.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published