Highlights
- Pro
Starred repositories
A Rust library and command-line tool for analyzing Power-Law distributions in empirical data.
This repo is the official megengine implementation of the ECCV2022 paper: Efficient One Pass Self-distillation with Zipf's Label Smoothing.
The official Zig programming language logo & copyright information
A Git-compatible VCS that is both simple and powerful
Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Zen is a general purpose programming language designed to build simple, reliable and efficient programs.
🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.
😎 Awesome list of Retrieval-Augmented Generation (RAG) applications in Generative AI.
分享一些好用的 Dify DSL 工作流程,自用、学习两相宜。 Sharing some Dify workflows.
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
An implementation of a small TCP/IP protocol stack for learning.
Programs to examine Zipf's meaning-frequency law through contextual diversity
🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"