Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View benywon's full-sized avatar

Block or report benywon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 542 33 Updated May 16, 2025

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 8,316 653 Updated Oct 22, 2025

MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.

C++ 4,626 746 Updated Jul 29, 2024

A series of large language models developed by Baichuan Intelligent Technology

Python 4,121 293 Updated Nov 8, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,026 727 Updated Oct 17, 2025

ggml implementation of the baichuan13b model (adapted from llama.cpp)

C 55 3 Updated Jul 27, 2023

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,688 619 Updated Feb 21, 2025

Fast and memory-efficient exact attention

Python 20,147 2,081 Updated Oct 24, 2025

CMMLU: Measuring massive multitask language understanding in Chinese

Python 790 64 Updated Dec 6, 2024

中文大语言模型评测第二期

71 3 Updated Oct 23, 2023

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,961 237 Updated Sep 6, 2023

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 11,897 930 Updated Mar 11, 2025

基于baichuan-7b的开源多模态大语言模型

Python 72 7 Updated Dec 7, 2023

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,684 506 Updated Jul 18, 2024

Gaokao Benchmark for AI

108 6 Updated Jul 8, 2022

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,482 3,301 Updated Aug 17, 2024

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,846 527 Updated Oct 24, 2025

Transformer related optimization, including BERT, GPT

C++ 6,331 921 Updated Mar 27, 2024

Train transformer language models with reinforcement learning.

Python 16,003 2,244 Updated Oct 24, 2025

Ongoing research training transformer models at scale

Python 13,940 3,184 Updated Oct 24, 2025

Collections of vector search related libraries, service and research papers

1,527 103 Updated Aug 6, 2024

Approximate nearest neighbor search with product quantization on GPU in pytorch and cuda

Cuda 228 22 Updated Dec 12, 2023

Automatically create Faiss knn indices with the most optimal similarity search parameters.

Python 873 79 Updated May 21, 2024

Library for 8-bit optimizers and quantization routines.

780 48 Updated Aug 18, 2022

Fast and memory-efficient clustering

Jupyter Notebook 262 44 Updated Oct 16, 2023

hnsw lib with hamming distance and uint32 coding

C++ 4 1 Updated Sep 29, 2019

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,494 4,589 Updated Oct 24, 2025

Doppler data from TIANWEN-1

Jupyter Notebook 10 1 Updated Jan 28, 2023
Next