> # Load the necessary libraries<\/span>\r\n> library(forecast)\r\n> library(MAPA)\r\n> library(tsintermittent)\r\n> \r\n> # In-sample data: 'y.trn'<\/span>\r\n> # Out-of sample: 'y.tst'<\/span>\r\n> # Forecasts from our new brilliant method are stored in 'mymethod'<\/span>\r\n> \r\n> # Start timer - Just to see how long this takes<\/span>\r\n> tm <- proc.time()\r\n> \r\n> # Let's produce some benchmarks<\/span>\r\n> frc <- array(NA,c(7,24)) # 7 benchmarks, forecast horizon 24\r\n> \r\n> fit.ets <- ets(y.trn)\r\n> fit.arima <- auto.arima(y.trn)\r\n> fit.mapa <- mapaest(y.trn,paral=1,outplot=FALSE)\r\n> frc[1,] <- rep(y.trn[n-24],24)\r\n> frc[2,] <- forecast(fit.ets,h=24)$mean\r\n> frc[3,] <- forecast(fit.arima,h=24)$mean\r\n> frc[4,] <- mapafor(y.trn,fit.mapa,fh=24,ifh=0,outplot=FALSE)$outfor\r\n> frc[5,] <- crost(y.trn,h=24)$frc.out\r\n> frc[6,] <- crost(y.trn,h=24,type=\"sba\")$frc.out\r\n> frc[7,] <- tsb(y.trn,h=24)$frc.out\r\n> rownames(frc) <- c(\"Naive\",\"ETS\",\"ARIMA\",\"MAPA\",\"Croston\",\"SBA\",\"TSB\")\r\n> \r\n> # Calculate accuracy<\/span>\r\n> PE <- (matrix(rep(y.tst,8),nrow=8,byrow=TRUE) - \r\n+ rbind(mymethod,frc))\/matrix(rep(y.tst,8),nrow=8,byrow=TRUE)\r\n> MAPE <- rowMeans(abs(PE))*100\r\n> \r\n> # Stop timer<\/span>\r\n> tm <- proc.time() - tm\r\n<\/pre>\n

So what we have here is forecasts from ‘mymethod’ and some benchmarks:<\/p>\n

Croston’s method: this is supposed to be used to intermittent data, but I just wanted to demonstrate how easy is to use this as a benchmark. This is from the ‘tsintermittent’ package.<\/li>\n

SBA: this a variant of Croston’s method from the ‘tsintermittent’ package.<\/li>\n

TSB: another intermittent demand method from the ‘tsintermittent’ package.<\/li>\n<\/ul>\n
Of course not all benchmarks are appropriate, but I wanted to demonstrate how easy is to use them. Accuracy is assessed using Mean Absolute Percentage Error (MAPE), for t+1 up to t+24 forecasts.<\/p>\n
> print(round(MAPE,2))\r\nmymethod Naive ETS ARIMA MAPA Croston SBA TSB \r\n 5.76 7.37 6.00 7.34 5.59 5.76 7.07 7.26<\/span> \r\n> print(tm)\r\n user system elapsed \r\n 4.24 0.00 4.86<\/span> \r\n<\/pre>\n
Apparently mymethod is pretty good, but in terms of accuracy MAPA seems to be doing better. Perhaps it is interesting to see that Croston’s method is not that bad, which makes sense considering that for non-intermittent data it is equivalent to exponential smoothing.<\/p>\n
The whole benchmarking took very little coding and only 4.86 seconds to run. This could be sped up using parallel processing that both ‘forecast’ and ‘MAPA’ packages support. Of course we would prefer to have used rolling origin evaluation (cross-validation), and this could be implemented easily with a loop.<\/p>\n
Here is the series and the various forecasts:<\/p>\n
> cmp = rainbow(8, start = 0\/6, end = 4\/6)\r\n> ts.plot(y.trn,y.tst,ts(t(rbind(mymethod,frc)),frequency=12,end=end(y.tst)),\r\n+ col=c(\"black\",\"black\",cmp))\r\n> legend(\"bottomleft\",c(\"MyMethod\",rownames(frc)),col=cmp,lty=1,ncol=2)\r\n<\/pre>\n
$\"bench.fig1\"$ <\/a><\/p>\n
The example series is ‘referrals’ from the ‘MAPA’ package. In other posts here you can find more information about the functions in the MAPA<\/a> and tsintermittent<\/a> packages.<\/p>\n
To conclude: benchmark your forecasts, it is easy and necessary!<\/strong><\/p>\n
Related Posts<\/H3>Special issue on innovations in hierarchical forecasting<\/a><\/li>\n
Intermittent demand & THieF – EJOR Editors\u2019 Choice Articles<\/a><\/li>\n
Automatic robust estimation for exponential smoothing: perspectives from statistics and machine learning<\/a><\/li>\n<\/ul><\/div>","protected":false},"excerpt":{"rendered":"
Over the years I have reviewed numerous papers that do not properly benchmark the various methods proposed. In my opinion if a paper has an empirical evaluation, then it must have appropriate benchmarks as well. Otherwise, one cannot claim that convincing empirical evidence is provided. The argument is simple: if the proposed method does not\u2026 Read More »<\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[41],"tags":[24,32,22,38,39],"_links":{"self":[{"href":"https:\/\/kourentzes.com\/forecasting\/wp-json\/wp\/v2\/posts\/500"}],"collection":[{"href":"https:\/\/kourentzes.com\/forecasting\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/kourentzes.com\/forecasting\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/kourentzes.com\/forecasting\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/kourentzes.com\/forecasting\/wp-json\/wp\/v2\/comments?post=500"}],"version-history":[{"count":0,"href":"https:\/\/kourentzes.com\/forecasting\/wp-json\/wp\/v2\/posts\/500\/revisions"}],"wp:attachment":[{"href":"https:\/\/kourentzes.com\/forecasting\/wp-json\/wp\/v2\/media?parent=500"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/kourentzes.com\/forecasting\/wp-json\/wp\/v2\/categories?post=500"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/kourentzes.com\/forecasting\/wp-json\/wp\/v2\/tags?post=500"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}