Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Official Repository for The Paper, Zero-Effort Image-to-Music Generation: An Interpretable RAG-based VLM Approach

Notifications You must be signed in to change notification settings

RS2002/Image2Music

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Image2Music

Article: Zijian Zhao*, Dian Jin, Zijing Zhou"Zero-Effort Image-to-Music Generation: An Interpretable RAG-based VLM Approach" (under review)

1. Worflow

2. Dataset

amaai-lab/MidiCaps · Datasets at Hugging Face

Please rename the train.json as meta.txt.

The data process part is based on the code of jwdj/EasyABC: EasyABC (github.com).

3. Run the Model

python main.py

4. Citation

@misc{zhao2025zeroeffortimagetomusicgenerationinterpretable,
      title={Zero-Effort Image-to-Music Generation: An Interpretable RAG-based VLM Approach}, 
      author={Zijian Zhao and Dian Jin and Zijing Zhou},
      year={2025},
      eprint={2509.22378},
      archivePrefix={arXiv},
      primaryClass={cs.SD},
      url={https://arxiv.org/abs/2509.22378}, 
}

5. Links

Some websites provide the service for abc2midi and midi2abc:

midi2abc (marmooo.github.io)

ABC notation converter - Nota ABC (notabc.app)

About

Official Repository for The Paper, Zero-Effort Image-to-Music Generation: An Interpretable RAG-based VLM Approach

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages