Thanks to visit codestin.com
Credit goes to github.com

Skip to content
Change the repository type filter

All

    Repositories list

    • rasa

      Public
      💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
      Python
      4.9k000Updated Mar 22, 2022Mar 22, 2022
    • OpenNRE

      Public
      An Open-Source Package for Neural Relation Extraction (NRE)
      Python
      1.1k000Updated Dec 9, 2021Dec 9, 2021
    • trankit

      Public
      Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
      Python
      104000Updated Apr 28, 2021Apr 28, 2021
    • pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
      Python
      984000Updated Jan 10, 2021Jan 10, 2021
    • Sentence Embeddings with BERT & XLNet
      Python
      2.7k000Updated Nov 30, 2020Nov 30, 2020
    • MiNLP

      Public
      XiaoMi Natural Language Processing Toolkits
      Python
      90000Updated Nov 18, 2020Nov 18, 2020
    • Ongoing research training transformer language models at scale, including: BERT & GPT-2
      Python
      3.2k000Updated Nov 18, 2020Nov 18, 2020
    • Chinese Pre-Trained Language Models (CPM-LM) Version-I
      Python
      211000Updated Nov 17, 2020Nov 17, 2020
    • UER-py

      Public
      Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
      Python
      526000Updated Nov 11, 2020Nov 11, 2020
    • CLUE

      Public
      中文语言理解基准测评 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
      Python
      547000Updated Nov 6, 2020Nov 6, 2020
    • gpt2-ml

      Public
      GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
      Python
      333000Updated Oct 27, 2020Oct 27, 2020
    • seq2seq

      Public
      A general-purpose encoder-decoder framework for Tensorflow
      Python
      1.3k000Updated Oct 15, 2020Oct 15, 2020
    • code for ACL 2020 paper: FLAT: Chinese NER Using Flat-Lattice Transformer
      Python
      171000Updated Sep 25, 2020Sep 25, 2020
    • texar

      Public
      Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
      Python
      369000Updated Sep 17, 2020Sep 17, 2020
    • 100+ Chinese Word Vectors 上百种预训练中文词向量
      Python
      2.3k000Updated Aug 24, 2020Aug 24, 2020
    • minGPT

      Public
      A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
      Jupyter Notebook
      3k000Updated Aug 18, 2020Aug 18, 2020
    • a bot that generates realistic replies using a combination of pretrained GPT-2 and BERT models
      Jupyter Notebook
      28000Updated Aug 16, 2020Aug 16, 2020
    • Convolutional Neural Network for Text Classification in Tensorflow
      Python
      2.8k000Updated Aug 7, 2020Aug 7, 2020
    • GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)
      Python
      677000Updated May 16, 2020May 16, 2020
    • stanza

      Public
      Official Stanford NLP Python Library for Many Human Languages
      Python
      927000Updated May 16, 2020May 16, 2020
    • nltk

      Public
      NLTK Source
      Python
      3k000Updated May 10, 2020May 10, 2020
    • spaCy

      Public
      💫 Industrial-strength Natural Language Processing (NLP) with Python and Cython
      Python
      4.6k000Updated May 7, 2020May 7, 2020
    • Easy to use extractive text summarization with BERT
      Python
      308000Updated Apr 20, 2020Apr 20, 2020
    • A curated list of resources for Chinese NLP 中文自然语言处理相关资料
      1.7k000Updated Apr 13, 2020Apr 13, 2020
    • SpaCy 中文模型 | Models for SpaCy that support Chinese
      Jupyter Notebook
      112000Updated Mar 12, 2020Mar 12, 2020
    • snownlp

      Public
      Python library for processing Chinese text
      Python
      1.4k000Updated Jan 19, 2020Jan 19, 2020
    • albert_zh

      Public
      A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
      Python
      750000Updated Jan 7, 2020Jan 7, 2020
    • Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
      Python
      642000Updated Dec 7, 2019Dec 7, 2019
    • Chinese version of GPT2 training code, using BERT or BPE tokenizer.
      Python
      1.7k000Updated Nov 4, 2019Nov 4, 2019
    • lingvo

      Public
      Lingvo
      Python
      452000Updated Oct 25, 2019Oct 25, 2019