DCP

DCP is a context parallel training library designed for dynamic model input lengths and attention masks. It introduces fine-grained blockwise partitioning of both data and computation, enables flexible mapping of data and computation blocks to any device, and optimizes such mapping through a hypergraph partitioning framework.

For more details on the design, please see our SOSP'25 paper: DCP: Addressing Input Dynamism In Long-Context Training via Dynamic Context Parallelism.

Installation

Using Docker (Recommended)

The easiest way to set up DCP with all dependencies is using Docker.

cd docker

# For standard environments
docker build --build-arg AWS=false -t dcp:latest .

# For AWS environments with EFA network support
docker build --build-arg AWS=true -t dcp:latest .

# Run the container
docker run --gpus all -it dcp:latest

Note: See docker/Dockerfile for the complete installation process used to generate the container for our experiments.

Manual Installation

DCP requires the following dependencies:

Custom PyTorch: A custom branch that supports all_to_all_single with zero-byte send/recv operations

git clone --recursive -b alltoallv https://github.com/chenyu-jiang/pytorch.git
cd pytorch && pip install -e .

Custom FlashAttention: Forked from version 2.6.3, supports specifying attention masks with ranges (limited to at most two ranges per sequence)
```
git clone --recursive -b dcp https://github.com/chenyu-jiang/flash-attention.git
cd flash-attention && pip install -e . --no-build-isolation
```
Note: This installs the custom FlashAttention as dcp_flash_attn to avoid overriding the original FlashAttention package.

DCP Library:

git clone https://github.com/chenyu-jiang/dcp.git
cd dcp && pip install -e . --no-build-isolation

Hypergraph Partitioners: mtkahypar, kahypar, PaToH, and pypatoh

For detailed installation steps, please refer to the docker/Dockerfile.

Quick Start

Below is a pseudo-code example demonstrating how to integrate DCP into a training pipeline. For a complete implementation example, see benchmark/mlm/monkey_patch.py and benchmark/mlm/pretrain_gpt.py, which show how DCP can be integrated with Megatron-LM.

# When defining models
from dcp.runtime.flash_attention.executor import DCPAttention, AttentionExecutor

class TransformerLayer(...):
    def forward(..., dcp_executor):
        ...
        # Replace attention implementation with DCPAttention
        core_attn_out = DCPAttention.apply(dcp_executor, q, kv)
        ...

# Define a mask function
def mask_fn(seqlens, ...):
    ...
    return mask

# In training script
from dcp.data.dataloader import DCPDataLoader

dcp_dataloader = DCPDataLoader(dataset, mask_fn)
# dcp_group is a communicator that connects all devices
# (e.g., torch.distributed.ProcessGroup)
dcp_executor = AttentionExecutor(group=dcp_group)

# Training iterations
for (local_data, execution_plan) in dcp_dataloader:
    # Set execution plan and create buffers
    dcp_executor.prepare(execution_plan)
    # Execute model
    loss = model(local_data, dcp_executor)
    ...

Citation

If you find DCP helpful in your work, we would appreciate a citation to our paper:

@inproceedings{jiang2025dcp,
  author = {Jiang, Chenyu and Cai, Zhenkun and Tian, Ye and Jia, Zhen and Wang, Yida and Wu, Chuan},
  title = {DCP: Addressing Input Dynamism In Long-Context Training via Dynamic Context Parallelism},
  booktitle = {Proceedings of the ACM SIGOPS 31st Symposium on Operating Systems Principles},
  series = {SOSP '25},
  year = {2025},
  pages = {221–236}
}

Artifact Evaluation

Please refer to this document for SOSP'25 Artifact Evaluation.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
benchmark		benchmark
csrc		csrc
dcp		dcp
docker		docker
docs		docs
scripts		scripts
tests		tests
typings		typings
.clang-format		.clang-format
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DCP

Installation

Using Docker (Recommended)

Manual Installation

Quick Start

Citation

Artifact Evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DCP

Installation

Using Docker (Recommended)

Manual Installation

Quick Start

Citation

Artifact Evaluation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages