Lists (1)
Sort Name ascending (A-Z)
Stars
Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.
[EMNLP 2025 Oral] MemoryOS is designed to provide a memory operating system for personalized AI agents.
一个面向中文文本纠错任务的综合平台,集学术研究、模型训练、模型评测和推理部署于一体,覆盖拼写纠错与语法纠错两个核心方向。
A python module to repair invalid JSON from LLMs
MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"
A survey on harmful fine-tuning attack for large language model
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…
Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
Universal and Transferable Attacks on Aligned Language Models
deepspeedai / Megatron-DeepSpeed
Forked from NVIDIA/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Open-Sora: Democratizing Efficient Video Production for All
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。
🔎 Monitor deep learning model training and hardware usage from your mobile phone 📱
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
A comprehensive, unified and modular event extraction toolkit.
Train transformer language models with reinforcement learning.
Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Pre-Ranking, Ranking (CTR/CVR prediction), Post Ranking, Relevance, LLM, Rei…
苏州大学研究生毕业论文Latex模板 - Overleaf
516PAI / SignificanceTest
Forked from JinJackson/SignificanceTestStatistical Significance Test (统计显著性检验) & Practical Significance Test (现实显著性检验)
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
This is the repository for paper "CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models"