[WIP] RollingWindow cross-validation #3638

0x0L · 2014-09-04T20:42:53Z

A cross-validation strategy for timeseries, see http://robjhyndman.com/hyndsight/tscvexample

Initial commit, tests and unfinished docs

I don't really like the name of the class. Hopefully, someone will find a better name for this.

coveralls · 2014-09-04T20:57:17Z

Changes Unknown when pulling c806ebf on x0l:cv_timeseries into * on scikit-learn:master*.

coveralls · 2014-09-04T21:19:53Z

Changes Unknown when pulling 348f114 on x0l:cv_timeseries into * on scikit-learn:master*.

coveralls · 2014-09-04T21:31:26Z

Changes Unknown when pulling 348f114 on x0l:cv_timeseries into * on scikit-learn:master*.

coveralls · 2014-09-04T21:56:41Z

Changes Unknown when pulling c056e12 on x0l:cv_timeseries into * on scikit-learn:master*.

coveralls · 2014-09-04T22:28:26Z

Changes Unknown when pulling 698d723 on x0l:cv_timeseries into * on scikit-learn:master*.

coveralls · 2014-09-06T00:35:08Z

Changes Unknown when pulling 8854b66 on x0l:cv_timeseries into * on scikit-learn:master*.

A cross-validation strategy for timeseries, see http://robjhyndman.com/hyndsight/tscvexample Initial commit, tests and unfinished docs

amueller · 2015-01-13T22:12:15Z

Sorry for the lack of feedback. A lot of the devs are very busy at the moment.
We don't really have any time-series specific algorithms, so this might not be a great fit.

mjbommar · 2015-01-14T03:02:13Z

See also discussion here: #3202

elgehelge · 2016-09-01T14:09:38Z

I think this has been rejected for the wrong reasons.

Having a sequence where order matters, should not be confused with time series. Often you find yourself in a situation where you have a sequence of data points where the ordering matters without necessarily knowing anything about the time, only the relative time.

Let's say you want to predict future data points based on all previous data points - to validate this correctly you will have to choose a split that preserves order. You will train on the first part, and test on the last part, which you are pretending not to have seen yet. But you might want to calculate the score on more than just a single split.

Anyways, thanks for contributing @0x0L. This class is super helpful!

jnothman · 2016-09-01T14:53:14Z

Interesting. This wasn't rejected at all, though @amueller made a comment that he clearly went back on in #6322. This PR was ignored for no good reason, and the contributor closed it. A TimeSeriesSplit has recently been merged in #6586, but I admit that this implementation has some enviable features. I think we (@yenchenlin?) should look at porting some enhancements from here. And I like the name RollingWindow which may also be a term in the literature.

mjbommar · 2016-09-02T14:04:35Z

This is related to walk-forward optimization/cross-validation, which I had proposed building and Gael had rejected in this issue:
#3202

I spoke privately with @MechCoder on the topic recently but don't recall if we landed on anything specific.

0x0L · 2016-09-02T18:50:55Z

@elgehelge Thanks, that's nice

As I recall I closed the PR because I thought I would package it and few other contribs (notably RVMs) in separate repo but I never found the time to do so :)

jnothman · 2016-09-03T11:55:56Z

Ah well, sorry to all whose work was not deemed appropriate at the time. I think there's been a recent move to acknowledge the need for CV splitters that accommodate common kinds of non-IID data. If there is a better (i.e. familiar in its research community) name for TimeSeriesSplit, you have a couple of days to propose it! If there are features missing, let's work on it.

amueller · 2016-09-06T19:32:04Z

I think adding splitters that encourage good practices is good. While we don't really support time-series specific models (and I'd like to keep it that way), I think we should acknowledge that people are using sklearn models for time-series data (a lot!) and we should make it easy for them to do The Right Thing (tm)

0x0L changed the title ~~WIP RollingWindow cross-validation~~ [WIP] RollingWindow cross-validation Sep 4, 2014

RollingWindow cross-validation

c90a335

A cross-validation strategy for timeseries, see http://robjhyndman.com/hyndsight/tscvexample Initial commit, tests and unfinished docs

MechCoder force-pushed the master branch from 6deaea0 to 3f49cee Compare November 3, 2014 12:36

0x0L closed this Jan 13, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[WIP] RollingWindow cross-validation #3638

[WIP] RollingWindow cross-validation #3638

Uh oh!

0x0L commented Sep 4, 2014

Uh oh!

coveralls commented Sep 4, 2014

Uh oh!

coveralls commented Sep 4, 2014

Uh oh!

coveralls commented Sep 4, 2014

Uh oh!

coveralls commented Sep 4, 2014

Uh oh!

coveralls commented Sep 4, 2014

Uh oh!

coveralls commented Sep 6, 2014

Uh oh!

amueller commented Jan 13, 2015

Uh oh!

mjbommar commented Jan 14, 2015

Uh oh!

elgehelge commented Sep 1, 2016

Uh oh!

jnothman commented Sep 1, 2016

Uh oh!

mjbommar commented Sep 2, 2016

Uh oh!

0x0L commented Sep 2, 2016

Uh oh!

jnothman commented Sep 3, 2016 •

edited

Loading

Uh oh!

amueller commented Sep 6, 2016

Uh oh!

Uh oh!

Uh oh!

[WIP] RollingWindow cross-validation #3638

[WIP] RollingWindow cross-validation #3638

Uh oh!

Conversation

0x0L commented Sep 4, 2014

Uh oh!

coveralls commented Sep 4, 2014

Uh oh!

coveralls commented Sep 4, 2014

Uh oh!

coveralls commented Sep 4, 2014

Uh oh!

coveralls commented Sep 4, 2014

Uh oh!

coveralls commented Sep 4, 2014

Uh oh!

coveralls commented Sep 6, 2014

Uh oh!

amueller commented Jan 13, 2015

Uh oh!

mjbommar commented Jan 14, 2015

Uh oh!

elgehelge commented Sep 1, 2016

Uh oh!

jnothman commented Sep 1, 2016

Uh oh!

mjbommar commented Sep 2, 2016

Uh oh!

0x0L commented Sep 2, 2016

Uh oh!

jnothman commented Sep 3, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amueller commented Sep 6, 2016

Uh oh!

Uh oh!

jnothman commented Sep 3, 2016 •

edited

Loading