Stars
《Designing Data-Intensive Application》DDIA 第一版 / 第二版 中文翻译
The Site Reliability Workbook 站点可靠性工作手册 中文版
AcadHomepage: A Modern and Responsive Academic Personal Homepage
CodeRAG is an AI-powered tool for real-time codebase querying and augmentation using OpenAI and vector search.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
A package to build progressive web apps with Go programming language and WebAssembly.
Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
扫清go语言一切障碍,go语言实战、go语言从入门到精通,持续更新,欢迎star
⏩ Ship faster with Continuous AI. Build and run custom agents across your IDE, terminal, and CI
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
📖 The user manual for GitHub Copilot Workspace
Dataflow-guided retrieval augmentation for repository-level code completion, ACL 2024 (main)
Hierarchical Context Pruning (HCP): A strategy to optimize real-world code completion with repository-level pre-trained code large language models
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
TypeScript grammar for tree-sitter
这个项目是一个对 GitHub Copilot 进行逆向分析的工作。作者通过分析 Copilot 的 sourcemap 文件,成功还原了 Copilot 的部分源码结构和实现细节。
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
NLP 领域常见任务的实现,包括新词发现、以及基于pytorch的词向量、中文文本分类、实体识别、摘要文本生成、句子相似度判断、三元组抽取、预训练模型等。
Retrieval and Retrieval-augmented LLMs
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
✨✨Latest Advances on Multimodal Large Language Models