Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Question about dataset usage and broken SVGs in SVGX-SFT-1M #23

@e-yi

Description

@e-yi

Hi @ximinng,

Thanks for releasing the SVGX datasets. A few small questions about the datasets:

According to the README:

Available Datasets on Hugging Face:

But I presume SVGX-Core-250k is the actual SFT dataset needed for fine-tuning, based on how it's used in the code? Could you clarify this discrepancy?

Also, could you explain what SVGX_SFT_GEN_51k, SVGX_SFT_GEN_basic, and SVGX_SFT_UN_25k in SVGX-SFT-1M are exactly? I've noticed that a lot of SVGs in SVGX-SFT-1M dataset appear to be broken.

e.g.

Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions