Prophet: Observation Weights

Created on 28 Aug 2017 · 10Comments · Source: facebook/prophet

Allow the user to specify observation weights for the past which can be used to discount early periods.

enhancement

Source

seanjtaylor

👍14

Most helpful comment

I think that if you wanted to prototype this in a local fork for your current use case, it actually wouldn't be too involved. You would add the weights into the likelihood in the Stan model here:
https://github.com/facebook/prophet/blob/e41ed25646f44f713c110c30c07c678e4a07728e/python/stan/unix/prophet.stan#L140
(sigma_obs here is the standard deviation, so you would replace that with the standard error, which is sigma_obs / w, where the weight w would be set to sqrt(n)).

You then just need to pass vector[T] w into the Stan model in this block:
https://github.com/facebook/prophet/blob/e41ed25646f44f713c110c30c07c678e4a07728e/python/stan/unix/prophet.stan#L87

and then add it to the Python or R where you pass the data over to Stan (I'll show Py here, but it's parallel for R). You would add something like

'w': np.sqrt(history['n'].values)

to the dictionary here: https://github.com/facebook/prophet/blob/e41ed25646f44f713c110c30c07c678e4a07728e/python/fbprophet/forecaster.py#L1124

You would then install this local version, and then if you include n as a column in the dataframe that you pass into Prophet, it should work.

bletham on 8 Oct 2020

👍4

All 10 comments

Just wondering if this would this also allow for inverse-variance weighting? And if not, is some sort of observational weighting in the works?

mkiang on 19 Sep 2017

Are there any updates on the problem?

zhitkovk on 13 Dec 2017

What is the status of this enhancement?
Its a really useful feature which I would like to see in the package

pkmurthy on 7 Dec 2018

👍1

This would indeed be a very welcome feature. In the meantime, is there any way one could add an additional column ( = an additional regressor) that could simulate this?

adrfantini on 30 Sep 2019

I had the similar problem and got a little better result with additional column/regressor.
(mape 0.274 is reduced to 0.256 in cross validation results)
My input dataframe is created with hourly time series traffic data and I prepared a simple linearly increasing day count weight and applied it as multiplicative regressor as below.
Any comments?

df['recency'] = df.index // 24
...
m.add_regressor('recency', prior_scale=10, mode='multiplicative')
...
future["recency"] = future.index // 24

BryanKoo on 3 Jan 2020

👍3

@bletham How would the authors propose handling periodic data where sample size n varies markedly from period to period? Can I get the desired effect by repeating each weekly observation n times in the input file? will the confidence intervals be correct?

numeric-lee on 7 Oct 2020

@numeric-lee to understand better - you have weekly data, and for each week there are multiple observations. It sounds like these are being averaged? And the sample size is the number of points being included in that average, which is different each week?

Adding each point in n times would be equivalent to weighting the observations appropriately. It might be computationally slow though depending on how big n is.

bletham on 8 Oct 2020

@bletham Thanks, it seems to be an adequate workaround. is there an alternative (a weighting vector) which would be computationally faster that exists today, anything on the horizon?

numeric-lee on 8 Oct 2020

You then just need to pass vector[T] w into the Stan model in this block:
https://github.com/facebook/prophet/blob/e41ed25646f44f713c110c30c07c678e4a07728e/python/stan/unix/prophet.stan#L87

and then add it to the Python or R where you pass the data over to Stan (I'll show Py here, but it's parallel for R). You would add something like