=Paper=
{{Paper
|id=Vol-1815/paper14
|storemode=property
|title=Case Representation and Adaptation for Short-Term Load Forecasting at a Container Terminal
|pdfUrl=https://ceur-ws.org/Vol-1815/paper14.pdf
|volume=Vol-1815
|authors=Norman Ihle
|dblpUrl=https://dblp.org/rec/conf/iccbr/Ihle16
}}
==Case Representation and Adaptation for Short-Term Load Forecasting at a Container Terminal==
142 Case Representation and Adaptation for Short-Term Load Forecasting at a Container Terminal Norman Ihle OFFIS, Institute for Information Technology, Oldenburg, Germany norman.ihle@offis.de Abstract. The electricity consumption of a terminal is mainly related to the number of container movements and the weather of each day. With the introduction of electric mobility for heavy duty container carriers at a seaport container terminal short-term load forecasting gains an impor- tant part in the procurement process. This paper describes a case-based approach to the forecasting of the electricity consumption time-series of the following day based on historical consumption load curves. It mainly focuses on the case representation which is based on a daily view on the so-called sailing list that is used to plan terminal operations and the adaptation processes that are applied to the time-series after case retrieval. The evaluation of the approach shows some promising first re- sults. 1 Introduction The introduction of electric mobility in industrial enterprises increases the im- portance of energy procurement even in sections where energy procurement was not in focus until today. Especially when some processes are changed from con- ventional to electric mobility in the division of logistics, economic potentials arise through the use of flexible electricity supply contracts. One example of such a logistic system is a seaport container terminal. Tasks like the transport of containers from the quay crane to the bulk storage area can be automated and electrified. This can be achieved by using battery-electric powered engines in the heavy-duty vehicles instead of diesel engines. In opposite to Diesel, that can easily be stored on terminal grounds, the fluctuating demand of the fuel ’electricity’ has to be procured on a short-term basis. When using the electricity exchange, the day-ahead market is the last chance to match the procured elec- tricity load curve with the latest demand forecast values. The forecast of a daily load curve, a time-series with 15-minute time stamps, is referred to as short-term load forecasting (STLF). Over time many methods for short-term load forecasting have been developed and are now in use. Some of those methods are developed for specific scenar- ios, others follow a more general approach. Regarding the application of these methods to the load forecast of a container terminal there is no experience doc- umented in the scientific literature. Even though the more general methods can Copyright © 2016 for this paper by its authors. Copying permitted for private and academic purposes. In Proceedings of the ICCBR 2016 Workshops. Atlanta, Georgia, United States of America 143 be applied to all kind of energy consumers, there is no systematic evaluation which method yields the best results when being applied to a single industrial site from the logistic domain. Since the consumption of electricity at a container terminal is subject to some special features that might not be available for other load forecasting scenarios, e.g. the consumption relies highly on the number of containers being handled, a solution based on Case-Based Reasoning (CBR) is proposed. The idea is that days with similar logistic operation requirements have similar load consumption patterns. This is due to the fact that only a few han- dling equipments have a great impact on the load profile, especially the quay and yard cranes and the reefer container. In the following a case-based load curve forecasting approach is presented that is based on the list of expected ship arrivals and departures. In chapter 2 the case representation is introduced with a focus on how the data for exactly one day is transformed and represented. Chapter 3 describes shortly how time values are handled in regard to similarity. If a similar case is retrieved the corresponding consumption time-series can be adapted to fit it to the forecasting situation. Two approaches for the adaptation are introduced in chapter 4. Chapter 5 then presents the first results of the evaluation before chapter 6 discusses some related work. Chapter 7 closes the paper with the conclusion and an outlook on further work. 2 Case Representation In seaport container terminals the number of logistic operations to be planned is very high. Containers are continuously delivered and picked up via train, truck or ship, the quay and yard cranes have to be assigned accordingly and the storage areas have to be administrated. The most important source for the operation planning is the list of ship arrivals and departures, the so called sail- ing list. The list is a table that includes the most relevant information for the expected ship arrivals and departures. The list is continuously updated with the newest available information that the terminal receives from the ship operators or freight companies. For example, a ship reports its departure from the previ- ous terminal and the expected travel time to the next terminal. Due to weather influences on the traveling speed and other circumstances this information can vary throughout the route. One day before a ship arrives the expected arrival time is pretty reliable. Figure 1 shows some examples from the sailing list of the Container-Terminal Altenwerder (CTA) in Hamburg (Germany) from Septem- ber 2013. The data from the sailing list can be used to build up the case base while the time-series data of the electricity consumption is stored in a dedicated database outside of the case structure due to performance and efficiency reasons. For simplification reasons one row of the sailing list will be referred to as arrival in the following. 144 Fig. 1. Example of sailing list data 2.1 Using Sailing List Data for Case Modeling In a first step data from the columns of the sailing list can be used for case modeling using an attribute-value representation. Besides the expected arrival and departure times of each ship, the sailing list includes information about how many containers have to be loaded and unloaded from each ship during the berthing time. These data together with the ship type can then be used to find similar days in the past. Similar in this case means that ships with a comparable number of containers to be handled have been arriving or departing at almost the same times of the day. The information is adopted for case modeling: – JSNR: Attribute CSJSNR – Ship type: Attribute CSType – Expected arrival time: Attribute CSArrival – Expected departure time: Attribute CSDeparture – Number of containers to unload: Attribute CImport – Number of containers to load: Attribute CExport The JSNR is a unique identifier of each ship’s berthing time. The ship type distinguishes between a barge, a feeder ship and different seagoing vessels, often represented by the code of the regular tour they are operating on. The expected arrival and the expected departure times describe the expected berthing time, which is stored, expressed in minutes, in an additional attribute (CSBerthing ). 2.2 Splitting the Berthing Time to Represent One Day Some entries in the sailing list represent a berthing that includes two or more dates according to the calendar, e.g. the ship APL VANDA with JSNR 307896 in Figure 1. Since the time span for the electricity consumption forecast is one day it is necessary to build up arrivals that represent the information for exactly one day from 00:00 to 23:59 o’clock. The available information has to be adapted to fit into a daily view of the sailing list information. To build up the daily view a berthing that exceeds the date limit can be split up to fit into multiple (virtual) arrivals, each representing the data for exactly one date. The information for JSNR 307896 can be split up to the following three entries: 145 – Arrival 06.09.2013 17:55 - Departure 06.09.2013 23:59 – Arrival 07.09.2013 00:00 - Departure 07.09.2013 23:59 – Arrival 08.09.2013 00:00 - Departure 08.09.2013 14:00 The number of containers to be handled per ship have to split up as well to fit the new virtual arrival information. For the split up, knowledge about the crane assignment at the seaside of the terminal can be used since the container handling is not constant during the berthing time. Using the simulation model presented in [2] and [3] it can be shown that the container handling numbers for large ships (more than 250 containers to be handled) decrease over the berthing time. After some preparation time the handling number starts at a high rate that is decreasing over time because cranes get allocated to other quay areas due to little space at the quay or higher priority of other arriving ships. To approximate this observation, a weighting function is introduced to calculate the number of containers C that is handled at each full hour hi of the berthing time. Full hours are chosen here to approximate the distribution of the handling numbers over the time because more exact information is not available and a break-down to 15 minute values has no further advantage. Based on the average linear handling rate that is needed to load and unload all designated containers Ctotal of the ship during all full hours of the berthing time hn it weights the first hours high and decreases linear over time: 0, 6 Ctotal Chi = (1, 3 − ∗ (hi − 1)) ∗ with hi = 1, ..., hn (1) hn − 1 hn Based on this function the container numbers of a sailing list entry can be split into handling numbers for each virtual arrival and departure. If a ship arrives at the exact beginning of an hour, the first full hour reserved for handling preparations is not regarded for htotal and gets assigned a container handling number of 0. This is also applied if the ship arrives within a full hour. 2.3 Adding Additional Daily Information Having adjusted the sailing list so that all available data can be assigned to exactly one day, daily cases can be built up. For each daily case additional infor- mation can be added. The values for the average temperature and wind speed of each day are of special interest because it can be shown that particularly the temperature value has an impact on the electricity consumption. Besides the weather information the weekday is added to the case since different repeat- ing consumption patterns can be observed in the electricity consumption load curves of different days. While the differences between the weekdays are not as remarkable, differences between load curves of weekdays and days on the week- end differ even though the container terminal realizes a 24/7 operation. This might be explained by the lesser use of the office building and the Sundays driv- ing prohibition for trucks in Germany. A special kind of weekday are holidays at which the terminal is actually shut down as well as the day before and the day after such a holiday. Figure 2 shows some typical load curves on different 146 Fig. 2. Example of typical load curves days. Two attributes with daily information can be derived from the sailing list information: the number of arrivals for a day and the overall number of con- tainers that have to be handled. The second value is a good indicator for the average electricity consumption level for that day. The more containers have to be handled the higher the average level. Additionally an attribute saves the date value. It has the function to link to the corresponding time-series data of the day. 3 Similarity for Time-Based Attributes When comparing two cases the arrival and the departure time of each ship plays an important role to assess the similarity. In the sailing list the information is available in a date-time format, the date being separated from the time by a blank. Since the single cases represent a daily view on the terminal operations, the date information can be disregarded: only the time information is interesting for similarity calculations. In order to be able to use simple existing similarity measures the time value is converted to an integer value. This value represents the number of minutes that have passed since the start of the day. The time value 00:00 therefore is represented by 0 and the end of the day at 23:59 by the value of 1439. If the absolute distance between the time values of two different arrival attributes is less than 60 both arrivals were not more than an hour apart at the point in time of the day and are therefore quite similar. The similarity decreases fast if the difference increases. To represent this, a sigmoid similarity function based on the distance is chosen as suggested by [1]: 1 simCSArrival (dCSArrival ) = d−120 (2) e 30 +1 The same function can be used for the departure time as well and with slightly different parameters it can also be used for the attribute CSBerthing . 147 4 Adaptation The adaptation of the electricity consumption time-series related to similar cases is twofold. Both approaches are based on data stored in the cases. The first adaptation approach focuses on the single values of the time series and tries to adapt the values of each hour of one day based on the difference in the number of container to be handled in each hour. The second one is based on the differences in the weather data and is applied to the general level of all time series values for one day. 4.1 Hourly Container Number Comparison It can be shown that the number of containers handled in an hour has a remark- able impact on the characteristics of the electricity consumption time series. The higher the number of containers handled the higher the electricity consumption. The idea behind the first adaptation approach is to find hourly differences in the number of containers handled on the day of the most similar case and the day of the query. For this purpose the aggregated container moves per hour i are calculated. For each arrival that is related to the day of CASEsim the container number is split into hourly values using the weighting function discussed in chap- ter 2.2. In this context not the exact daily values of the virtual arrivals are used, but the original entries are used in order to be able to use the weighting function properly and the hourly values are not stored. The same can be done for each arrival related to the day of the query (QU ERY ). Having done so, the difference di between the container handling numbers of CASEsim and QU ERY can be calculated for each hour. The first approach was to use this hourly difference directly to apply an adaptation factor to the time series values of that specific hour, but the evaluation showed that this approach had the shortcoming that it does not take into account the operations before and after the hour, but they also have an influence on the electricity consumption of that hour since prepara- tions or post-processing also have to be executed. It might also happen that the difference in the container handling number is subject to strong fluctuations, for example when large seagoing vessels with a huge amount of containers arrive and depart, but the fluctuation in the electricity consumption is not as high at that point of time because of smoothing effects that cause some delay in the change of consumption pattern. For this reason not only difference di in the hour i is regarded when choosing an adaptation factor, but also difference in the hour before i − 1 and the difference in the hour after i + 1 are also considered. At the moment an adaptation factor is assigned to different times of day depending on the discussed differences using a rule set. Table 1 shows an extract of the rules as they are applied for the hour 8 (08:00 - 08:59 o’clock) of the time series. The F actor is added to the time series values of the hour. It can be shown that the fluctuations are higher in the beginning of the day, so the adaptation factors are higher than the ones for morning and evening hours, while the adaptation factors for the hours in the middle of the day are rather low. 148 Hour i di di−1 di+1 F actor 8 > 200 > 150 > 150 0.4 8 > 150 > 120 > 120 0.3 8 > 100 > 70 > 70 0.2 8 > 50 > 20 > 20 0.1 8 < -50 < -20 < -20 -0.1 8 < -100 < -70 < -70 -0.2 8 < -150 < -120 < -120 -0.3 8 < -200 < -150 < -150 -0.4 Table 1. Adaptation factors for hour 8 of a day 4.2 Regarding Seasonal and Weather Influences The second adaptation approach is based on the available weather information. The highest influence on the electricity consumption is based on the air temper- ature. It can be observed that the energy consumption of the container terminal is on average significantly higher in winter than it is in summer. This is mostly due to additional heating that is required when having low temperatures and more lighting is needed on winter days than in summer days. Since the available temperature values are average temperatures for one day, the differences in the temperatures can be applied to increase or decrease the level of the electricity consumption, but they are only supposed to be applied when the case represents a winter day and the query a summer day or the other way around and the temperature difference is greater than 5 degrees Celsius, since a difference of 5 degrees on summer days does not show a significant effect on the electricity con- sumption. Additionally boundaries for the adaptation were defined. This means that if a consumption value is already very high and the temperature based adaptation would raise this value beyond the upper boundary, the adaptation is not applied to avoid a forecast of values that are usually not reached under normal operating conditions of the terminal. The same applies to very low values and the lower boundary. 5 Evaluation For the evaluation the sailing lists and the corresponding electricity consump- tion time-series of the years 2010 through 2013 of the Container-Terminal Al- tenwerder are used. Today, forecasts are generated by using the last week’s values as forecast values (last week approach). While these values fit in some cases, they also generate some remarkable deviation from the forecast to the metered con- sumption in other cases. So besides improving the forecasting accuracy, it is also a goal to decrease the maximum deviations in the forecast compared to the real consumption. For the CBR approach the adapted electricity consumption val- ues of the most similar case are used as forecast values. Besides this forecast a second one based on CBR is calculated by using the mean value of the adapted 149 time-series values of the three most similar cases instead of only the one of the most similar case. This forecast will be referred to as CBR Top 3 approach. The first results of the evaluation show that the CBR approach can be success- fully applied for forecasting the energy consumption of a container terminal. When forecasting the electricity consumption load curve for every day in June 2013, the last week approach yields an average Mean Absolute Percentage Error (MAPE) of 12.0 while the CBR approach yields to 10.0 in average. Having a yearly electricity consumption of more than 65000 MWh, a 2% better forecast avoids penalty fees that have to be paid when the procured electricity load curve does not fit the actual metered one can be avoided to a great extent. With the CBR Top 3 approach the results even yield to a mean of 8.6 which means a further improvement on the forecast. Figure 3 shows the MAPE values for every Fig. 3. MAPE values for June 2013 single day in June 2013 for the three approaches. It can be seen that on some days the MAPE values of the three approaches are close to each other while on some days the last week approach has very high values. On the 4th and the 8th of June the mean absolute percentage error of the last week approach is even higher than 30% while with the CBR Top 3 approach the highest values are slightly above 15%. This shows that the fluctuations can be decreased and the risks of over- or under-procurement of the electricity for a day is minimized. The Mean Squared Error (MSE) values of each day in June confirm these findings that can be observed when forecasting the following month as well. Over the year 2013 the monthly average MSE of the CBR Top 3 approach is never higher than 0.07 while for the last week approach the maximum average value is double that value. The average MAPE for the year 2013 is 8,87% for the CBR Top 3 approach, while the currently used last-week approach has a MAPE of 12,32% in average. 150 6 Related Work In the energy consumption forecasting domain only a few works have been pub- lished that present CBR approaches. [6] present the system FUTURA to fore- cast the medium-term load consumption in Peru. To adjust a calculated start- ing solution, the CBR module uses the average of the retrieved similar cases to smooth out fluctuations of the starting solution. The similarity is based on time attributes like month, weekday, and the time of day. Weather influences are disregarded. This adjusted solution is then adapted based with the expert knowledge regarding the long-term trends in environmental development. It is stated that the expert knowledge can change the load curve up to 20% depend- ing on the influencing parameters. [7] presents an approach for the forecast of power peaks in a distribution grid based on CBR methods. As attributes the author uses weekdays, weekday-types and several weather indicators. The case base is clustered with methods of Self-Organizing Maps (SOM) to allow for a highly efficient retrieval. The k-nearest neighbor retrieval uses a weighted sum of all attributes of the request and a cluster. The most similar case is then re- trieved out of the cluster with the highest similarity value. The power peak of this most similar day is then taken as power peak for the forecast day adapted by a factor that represents the yearly growth rate in the overall load of the grid. In a case study it is shown that the suggested solution outperforms simple last week value approaches and even artificial neural networks. [4] introduce a system for very short-term load forecasting of office buildings based on CBR-methods. The similarity is assessed with a focus on weather indicators like temperature, moisture, etc. The weights of the single attributes are dynamically adjusted re- garding the current season and current weather conditions. Using a subset of the most similar solutions a forecast for the next three hours is built up. It in- cludes a prognosis whether a new power peak will occur in this time horizon. The evaluation of the system shows that it can be applied successfully. The mean squared percentage error is stated to be around 12 to 14 percent. [5] extend this approach to a forecast horizon of 6 hours and present an on-line implementation of the system. They also specify the input attributes that are used. Besides the weather indicators and the current electricity consumption they also use power values of air conditioning and heaters currently in use and room temperatures currently metered within the office building. None of the related works deal with the daily short-term load forecasting of single industrial consumers based on operation plans. 7 Conclusion and Outlook This paper presented an approach to use CBR methods to realize a system for short-term load forecasting for a single industrial customer, a container terminal, and focused on the case modeling and the adaptation of the time-series. The forecast of the electricity consumption for every quarter of an hour of the next day is based on the logistic operation plan that is set up for the next day. This 151 plan includes data about expected ship arrival and departure times as well as information about the number of containers to be handled with each ship. This information can be used to find days in the past with a similar operation plan. The electricity consumption time-series of a similar day can than be adapted based on differences in the operations of each day and based on weather data. The approach has some advantages when compared to other established load forecasting methods: it is generally applicable to maritime container terminals, the computational cost is rather low and the results are comprehensible. The evaluations shows that the approach can be applied successfully. Further improvements might be reached by integrating information about the container numbers that are delivered and picked up by truck or train to approximate the number of container handles and their course more exactly. At the moment this number is estimated because of lack of the data. For the adaptation process a yearly factor can be introduced that will represent the yearly average change in electricity consumption that can be observed over the previous five years. This is also important because in general the case-based forecasting approach tends to underestimate the actual values when applied at the use-case CTA, where the average yearly electricity consumption constantly increased during the last years. It is to be investigated if a CBR-STLF-approach for single electricity consumers based on operation plans can be generalized to further domains than container terminals. Logistic systems can be in focus here as well as more different systems like industrial production sites. References 1. Bergmann, R.: Experience Management: Foundations, Development Methodology, and Internet-based Applications. Springer-Verlag, Berlin, Heidelberg (2002) 2. Grundmeier, N., Hahn, A., Ihle, N., Runge, S., Meyer-Barlag, C.: A simulation based approach to forecast a demand load curve for a container terminal using battery powered vehicles. In: Neural Networks (IJCNN), 2014 International Joint Conference on. pp. 1711–1718. IEEE (2014) 3. Grundmeier, N., Ihle, N., Hahn, A.: A discrete event-driven simulation model to analyse power consumption processes in container terminals. In: Simulation in Pro- duction and Logistics 2015. Fraunhofer IRB Verlag, Stuttgart (2015) 4. Monfet, D., Arkhipova, E., Choiniere, D.: Evaluation of a case-based reasoning energy prediction tool for commercial buildings. International Conference for En- hanced Building Operations (2013) 5. Platon, R., Dehkordi, V., Martel, J.: Hourly prediction of a building’s electricity consumption using case-based reasoning, artificial neural networks and principal component analysis. Energy and Buildings 92, 10 – 18 (2015) 6. Vilcahuaman, R., Melendez, J.: Improvement of electric load forecasting reusing past instances and contextual based adaptation. In: Transmission and Distribution Conference and Exposition: Latin America, 2004 IEEE/PES. pp. 271–276 (Nov 2004) 7. Wang, Z.Y.: Developed case-based reasoning system for short-term load forecasting. In: 2006 IEEE Power Engineering Society General Meeting. pp. 6 pp.– (2006)