Lists (16)
Sort Name ascending (A-Z)
Stars
🌞 CareGPT (关怀GPT)是一个医疗大语言模型,同时它集合了数十个公开可用的医疗微调数据集和开放可用的医疗大语言模型,包含LLM的训练、测评、部署等以促进医疗LLM快速发展。Medical LLM, Open Source Driven for a Healthy Future.
[CBLUE1] 中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
bzantium / llm2vec
Forked from McGill-NLP/llm2vecCode for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
[Medical_NLP ➟ Awesome-AI4Med] medical-related LLMs, Multimodal systems, Datasets, Benchmarks, and more.
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
The framework of training large language models,support lora, full parameters fine tune etc, define yaml to start training/fine tune of your defined models, data and methods. Easy define and easy s…
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)
Repository for "Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages"
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Generative Representational Instruction Tuning
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
[EMNLP 2024] Knowledge Graph Enhanced Large Language Model Editing
KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques
Open-source search and retrieval database for AI applications.
李白 👤 作为唐代杰出诗人,其诗歌作品在中国文学史上具有重要地位。近年来,随着数字技术和人工智能的快速发展,传统文化普及推广的形式也面临着创新与变革。国内外对于李白诗歌的研究虽已相当深入,但在数字化、智能化普及方面仍存在不足。因此,本项目旨在通过构建李白知识图谱,结合大模型训练出专业的AI智能体,以生成式对话应用的形式,推动李白文化的普及与推广。