-
-
Notifications
You must be signed in to change notification settings - Fork 26.3k
Closed
Labels
EnhancementModerateAnything that requires some knowledge of conventions and best practicesAnything that requires some knowledge of conventions and best practicesmodule:datasets
Description
We currently store the results of each HTTP request performed by fetch_openml
. It still can take a substantial amount of time to decode the fetched ARFF data into numpy arrays or sparse matrices. We could instead (or also) cache the decoded data on disk, so that the ARFF does not need to be parsed in repeated calls.
Alternatively, we can hope that OpenML provides a more compact data representation some time soon, as per openml/OpenML#388.
Metadata
Metadata
Assignees
Labels
EnhancementModerateAnything that requires some knowledge of conventions and best practicesAnything that requires some knowledge of conventions and best practicesmodule:datasets