Source Code for my portfolio site
It is still very much a work in progress but I'm using this as an opportunity to learn more about git and actually make something "in public" instead of deleting it once I'm done with it.
--
Backend for my portfolio site, public because is demo code, I guess.
-
- Dense vectors for semantic meaning on chunks of words within a document.
- Sparse vectors for keywords, finding "like" keywords.
- Exact Keyword + sparse L2(taxi cab) distance + dense Cosine distance scoring system.
- [PLANNED] Inclusion of a PageRank adjecent algorythm for page authority.
-
- Collected text from pages is converted into sparse and dense vectors
- text is broken into words and converted to sprase embeddings using Splade for keyword searches.
- then whole text is then split into large chunks and converted to dense embeddings using IBM Granite Embedding for context searches.
- Collected text from pages is converted into sparse and dense vectors