🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 151,734 30,970 Updated Oct 27, 2025

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,980 1,076 Updated Nov 18, 2024

salesforce / BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 5,552 721 Updated Aug 5, 2024

lucidrains / perceiver-pytorch

Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch

Python 1,175 137 Updated Aug 22, 2023

chenfei-wu / TaskMatrix

Python 34,359 3,275 Updated Jan 6, 2024

alibaba / x-deeplearning

An industrial deep learning framework for high-dimension sparse data

PureBasic 4,301 1,027 Updated Sep 25, 2024

motefly / MVKE

SIGKDD'2022: Mixture of Virtual-Kernel Experts for Multi-Objective User Profile Modeling

Python 45 12 Updated Aug 17, 2023

sisinflab / KGFlex

Official implementation of the paper "Sparse Feature Factorization for Recommender Systems with Knowledge Graphs"

Python 20 5 Updated Oct 13, 2022

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,283 3,808 Updated Jul 23, 2024

j-min / VL-T5

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

Python 373 58 Updated Jul 29, 2023

davidmrau / mixture-of-experts

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Python 1,193 110 Updated Apr 19, 2024

laekov / fastmoe

A fast MoE impl for PyTorch

Python 1,809 196 Updated Feb 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nyyznyyz1991

Block or report nyyznyyz1991

Starred repositories

Eladlev / AutoPrompt

Farama-Foundation / Gymnasium

OpenGVLab / Ask-Anything

deepspeedai / DeepSpeed

vllm-project / vllm

OFA-Sys / Chinese-CLIP

LAION-AI / scaling-laws-openclip

baichuan-inc / Baichuan-13B

pleisto / yuren-baichuan-7b

openai / summarize-from-feedback

acheong08 / EdgeGPT

google-research-datasets / conceptual-12m

haotian-liu / LLaVA

X-PLUG / mPLUG-Owl

facebookresearch / xformers

open-mmlab / Multimodal-GPT

lm-sys / FastChat

meta-llama / llama

huggingface / transformers