Stars
Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.
A complement to pgvector for high performance, cost efficient vector search on large workloads.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
📋 A list of open LLMs available for commercial use.
The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integra…
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
Contextual Encoder-Decoder Network for Visual Saliency Prediction [Neural Networks 2020]
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Selection of most dominant colors in image using Modified Median Cut Quantization
Word2Vec implementation on Czech Wikipedia data
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Largest multi-label image database; ResNet-101 model; 80.73% top-1 acc on ImageNet
How to use ELMo embeddings in Keras with Tensorflow Hub
Machine-readable lists of lemma-token pairs in 23 languages.