Thanks to visit codestin.com
Credit goes to github.com

Skip to content

lingo-iitgn/awesome-code-mixing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

88 Commits
 
 
 
 

Repository files navigation

Awesome Code-Mixing & Code-Switching

Awesome PRs Welcome

A curated list of papers, datasets, and toolkits for Code-Switching & Code-Mixing in Natural Language Processing.

Table of Contents

Click on any link to jump to the corresponding section on this page.


Survey Papers

Comprehensive reviews of the code-switching research landscape. A great place to start.


1. NLP Tasks

1.1. Natural Language Understanding (NLU) Tasks

Tasks focused on understanding, parsing, and extracting meaning from code-mixed text.

Language Identification (LID)

Part-of-Speech (POS) Tagging

Named Entity Recognition (NER)

Sentiment & Emotion Analysis

Syntactic Analysis

Intent Classification

Question Answering (QA)

Natural Language Inference (NLI)


1.2. Natural Language Generation (NLG) Tasks

Tasks focused on generating fluent and coherent code-mixed text.

Code-Mixed Text Generation

Machine Translation (MT)

Cross-lingual Transfer

Text Summarization

Dialogue Generation

Transliteration


2. Datasets & Resources

Corpora, toolkits, and frameworks to support your research.

Datasets

Frameworks & Toolkits


3. Model Training & Adaptation

Techniques for building and adapting models to understand and generate code-mixed language.

Pre-training Approaches

Fine-tuning Approaches

Post-training Approaches


4. Evaluation & Benchmarking

Resources for evaluating model performance on code-switching tasks.

Benchmarks

Evaluation Metrics


5. Multi & Cross-Modal Applications

Applying code-switching NLP to speech, vision, and other modalities.

Speech Processing

Vision-Language & Document Processing

Cross-Modal Integration


Workshops & Shared Tasks

A list of academic workshops and community shared tasks dedicated to code-switching.


Contributing

Your contributions are always welcome and make this community resource better!

If you have a paper, dataset, or tool you'd like to add:

  1. Fork the repository.
  2. Add your resource to the relevant section.
  3. Please try to follow the existing format and include a direct link.
  4. Submit a pull request!

About

A curated list of resources dedicated to Code-mixed Natural Language Processing (NLP).

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •