Thanks to visit codestin.com
Credit goes to github.com

Skip to content

fetch_openml: perhaps cache decoded ARFF #11821

@jnothman

Description

@jnothman

We currently store the results of each HTTP request performed by fetch_openml. It still can take a substantial amount of time to decode the fetched ARFF data into numpy arrays or sparse matrices. We could instead (or also) cache the decoded data on disk, so that the ARFF does not need to be parsed in repeated calls.

Alternatively, we can hope that OpenML provides a more compact data representation some time soon, as per openml/OpenML#388.

Metadata

Metadata

Assignees

No one assigned

    Labels

    EnhancementModerateAnything that requires some knowledge of conventions and best practicesmodule:datasets

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions