mcore_adapter

MCoreAdapter

MCoreAdapter is a lightweight bridge toolkit for scalable LLM/VLM training, combining NVIDIA Megatron-LM's distributed training efficiency with HuggingFace Transformers-like API simplicity.

Developed as Roll Framework's Megatron-LM integration layer, it enables seamless interoperability between Roll's reinforcement learning workflows and Megatron's distributed training capabilities.

Installation

pip install "git+https://github.com/alibaba/roll.git#subdirectory=mcore_adapter"

Usage

Except reinforcement learning with Roll, MCoreAdapter can also be applied for LLMs and VLMs in PreTraining, SFT and DPO/ORPO.

See examples for fine-tuning examples used LLaMA-Factory library.

Convert between HuggingFace and Megatron

Convert a Megatron model to HuggingFace model:

python tools/convert.py --checkpoint_path path_to_megatron_model --output_path path_to_output_hf_model

MCoreAdapter can directly load a HuggingFace model, so you can skip converting the model to Megatron.

Name		Name	Last commit message	Last commit date
parent directory ..
examples		examples
src/mcore_adapter		src/mcore_adapter
tools		tools
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

MCoreAdapter

Installation

Usage

Convert between HuggingFace and Megatron

FilesExpand file tree

mcore_adapter

Directory actions

More options

Directory actions

More options

Latest commit

History

mcore_adapter

Folders and files

parent directory

README.md

MCoreAdapter

Installation

Usage

Convert between HuggingFace and Megatron