CalTRACK - Project Updates

OpenEEmeter Technical Working Group Meeting Summary | March 5, 2024

3/27/2024

Thanks to everyone who joined our recent OpenEEmeter Technical Working Group meeting on March 5th, 2023.

Travis Sikes kicked off the meeting with an announcement of RetroMeter's use case review of OpenEEmeter 4.0 on Thursday, March 14th at 10am CST, in which they presented some of the work they've been doing adapting the OpenEEmeter for use cased in the U.K., and giving the OpenEEmeter developer community an opportunity to provide feedback on the user experience with the new API and desired features.

Travis then announced the full public release of OpenEEmeter 4.0, now available via pip install. You can learn more about OpenEEmeter 4.0 from the recent the Linux Foundation Energy webinar. The discussion then moved on to recent work on the hourly model. In the previous meeting, Armin Aligholian presented results showing the elastic net model outperforming XGBoost, AdaBoost and other regression models usable within scikit-learn in terms of test error, computation time, and reduced overfitting. The elastic net had lower error on cloudy days and lower bias.

In this meeting, Armin described how the team explored using an LSTM neural network architecture. While this approach showed some promise, the LSTM model was very computationally expensive, taking 14 minutes per meter on a CPU to achieve test error comparable to the elastic net.

The elastic net model is 11x faster than the current OpenEEmeter model, with lower test error and less overfitting. The team also looked at incorporating supplemental data like EV charging and pump schedules. Adding this binary time series data as an input feature improved predictions of energy spikes by 40% in a worst-case scenario.

Some key next steps are migrating the new elastic net model into the OpenEEmeter API, exploring adding NMBE to the loss function, analyzing performance on commercial buildings, and revisiting data sufficiency criteria in light of the new model structure. While the new architecture allows for easy incorporation of additional time series inputs, the group will need to be thoughtful about which inputs to allow in the base model to ensure quality and standards.

Thanks again to Travis and Armin for leading the group through the latest results and analyses, and to everyone for the great questions and discussion.

Next Meeting Scheduled: Tuesday, April 2, 2024

Watch the full presentation below.

Announcing OpenEEmeter 4.0!

3/18/2024

Thanks to everyone in the working group for all of your hard work and input on developing the LF Energy OpenEEmeter version 4.0.

LF Energy OpenEEmeter measures the energy impacts of demand-side interventions in buildings. OpenEEmeter 4.0 provides enhanced performance of the daily model with dramatically reduced seasonal and weekend/weekday bias, along with increased computational efficiency.

Among other benefits, OpenEEmeter 4.0:

Reduces seasonal bias in the daily model by 84%
Reduces weekend/weekday bias in the daily model by 95%
Runs up to 100x faster with monthly data, and 2 - 10x faster with daily data

Along with this release, the OpenEEMeter community is also publishing a detailed 4.0 model specification and results of thorough testing conducted across residential and commercial sectors and gas and electric fuels.

On March 12, 2024, LF Energy hosted a webinar which explained in detail how OpenEEmeter 4.0 was developed and why these advances are important for measuring the energy impacts of demand-side interventions in buildings.

Watch the full webinar below:

OpenEEmeter Technical Working Group Meeting Summary | February 6, 2024

2/27/2024

Travis led off the meeting with an explanation of why the group is integrating the written methods into the OpenEEmeter repository. The group then discussed the many functions (such as weather data) that are duplicated between the EEweather and OpenEEmeter and the advantage of integrating them, including eliminating redundancies and making the process of updating much simpler while providing a more seamless experience for users. Travis then explained a number of simplifications that have been made in OpenEEmeter 4.0 code, specifically including default data settings in the baseline class. As most users use these defaults, this will simplify the use of the software and offer more consistency and standardization, while still allowing more expert users to change defaults for their particular use cases. Travis also proposed some adjustments to data sufficiency requirements, such as removing the requirement that daily data for electric meters not have negative values, as this doesn't necessarily indicate an error (there are many solar users, for example, who may show have negative consumption on regular basis). Instead, a requirement would be added that non-electric meters cannot have negative values.

Armin recapped the discussion from last week of the advantage of switching from CVRMSE to PNRMSE as a more reliable model performance measure, especially for solar customers. Armin then explained how the working group has been testing different models with combined data from different weather features (solar irradiance, humidity, and temperature) to determine which perform the best. The team found that an elastic net model performs better than the current hourly model and better than other models tested, including for computational speed.

For future work, the team will continue to focus on the challenge of overfitting, load shape analysis for different seasons, and considering the potential of ensemble models.

Next Meeting Scheduled: Tuesday, March 5, 2024

Watch the full presentation below.

OpenEEmeter Technical Working Group Meeting Summary | December 5, 2023

1/2/2024

Thanks to everyone who joined the most recent OpenEEmeter working group.

Travis Sikes led off this meeting with a recap of the last meeting, in which the goal was to explore how to incorporate a variety of additional data inputs into the OpenEEmeter, such as temperature, humidity, and especially solar irradiance, in addition to contextual time series (day of week and month) data.

Travis pointed out that initially the team was using a stratified K-fold scheme within baseline period for cross validation, but has moved on from that due to a concern of information leak; instead, they are now using a rolling test/train approach to minimize model overfitting.

Travis then reviewed the previous discussion in which the group had discussed the need to move away from CVRMSC (Coefficient of Variation of Root Mean Squared Error) as metric for calibrating models which doesn't work well for buildings with solar panels. The group discussed instead using PNRMSE (Percentile Normalized Root Mean Squared Error), which appears to correlate well with CVRMSC.

Armin Aligholian then went into more detail on the switch from stratified sampling to the three years rolling test/train framework. He went on to explain how the team was exploring the addition of GHI (solar irradiance) and its impact on model performance, specifically for solar customers. Moreover, CCI (cloud cover index) was used as a metric to analyze the importance of GHI specifically on more cloudy days.

The meeting ended with a discussion of the need for more models in future work, including more work on neural network models, more input variation, as well as looking more closely at the impacts of cloud coverage, larger datasets for population analysis, and other factors.

Next Meeting Scheduled: Tuesday, February 6th, 2023.

Watch the full presentation below.

OpenEEmeter Technical Working Group Meeting Summary | November 7, 2023

11/27/2023

Thanks to everyone who attended the most recent OpenEEmeter working group meeting.

The meeting began with a discussion by Jason Chulock of coming improvements in the OpenEEmeter 4.0 API, including consolidating usage between all three methods — hourly, daily, and billing — and making certain common configurations the default. Jason then laid out improvements around data sufficiency and methods compliance. The goal of these changes is to make the API more user-friendly and efficient.

Travis Sikes then led a recap and discussion of the current issues and progress on the CalTRACK 2.0 model. Key concerns of CalTRACK 2.0 include its tendency to be overfit, its incompleteness for solar PV customers, and the inflexibility in handling input data.

Travis explained that the team would be using AMI measurements combined with weather, solar, and categorical data to enhance prediction accuracy. He then discussed evolving the cross-validation methodology from a static 24-hour window to a dynamic rolling test/train approach. There was a consensus on the need for a more robust error metric, suggesting a shift from CVRMSE to PNRMSE. Travis emphasized the need for commercial data to complete the test data sets.

Looking ahead, next steps include the continued exploration of advanced modeling techniques like neural networks and the use of larger datasets for a more thorough population analysis.

Next Meeting Scheduled: Tuesday December 5th, 2023.

Watch the full presentation below.

OpenEEmeter Technical Working Group Meeting Summary | October 3, 2023

10/19/2023

The working group is pleased to announce that they have wrapped up the OpenEEmeter 4.0 Daily Model, and that the Alpha version is now available to the public and can be installed using: pip install eemeter==4.0.0a2.

The discussion then moved to progress on the hourly model. Armin began by reviewing some of the issues with the current hourly model, including that it seems to be overfit, and that it is incomplete when it comes to solar photovoltaic customers, where it produces high error rates. The question is how to make the model flexible and allow more inputs, including solar data.

Armin then discussed two broad options for addressing this issue: disaggregating solar data, and including weather and solar data to improve AMI prediction. Solar disaggregation would be more complex, which suggests that improving AMI prediction might be preferable for now.

The discussion then moved to the question of what approach to take when adding complexity to the model. There was discussion of the pros and cons of various machine learning approaches, including elastic nets and neural nets, with elastic nets as a simpler and more "interpretable" option.

There was also discussion in the group about the potential downside of neural nets and losing the interpretability of the model; however it was pointed out when you add a lot of coefficients, even in linear model, you lose physical interpretability.

The discussion ended with next steps, including implementing rolling train/test cross validation, using larger datasets for population analysis, doing a deep dive on the impact of GHI in model improvement, and developing neural net models for AMI prediction.

Next Meeting Scheduled: Tuesday November 7th, 2023.

Watch the full presentation below.

OpenEEmeter Technical Working Group Meeting Summary | September 5, 2023

9/14/2023

Thanks to everyone who joined the last OpenEEmeter working group.

Adam Scheer kicked off the meeting with the announcement that the name CalTRACK was being deprecated; going forward, both methods and code will be referred to as the OpenEEmeter under a single umbrella to emphasize the tool's global relevance. This decision was influenced by feedback from clients and users that the name CalTRACK gave an impression of regional specificity, leading to concerns about the model's applicability outside California.

In addition, while it is appropriate to continue to have a detailed description of methods, the OpenEEmeter code is now sophisticated enough that it should be considered the source of truth as to what is actually happening with model calculations.

Adam then announced that the 2.1 daily model is at the point where it will soon be ready to be merged and available to everyone. In addition, the team will soon release comprehensive R&D results to support the decisions made in the final formulation of the model.

Next, Adam gave a high level overview of the 2.0 hourly methods to set the stage for discussing improvements in the performance of the 3.0 methods. Adam explained that 2.0 is founded on a Time of Week and Temperature (TOWT) model, which is primarily based on two variables: the hour of the week and the temperature. With 168 hours in a week, this results in a unique energy consumption prediction for each hour. To capture seasonal variations, the model is designed to create an independent prediction for each month of the year, considering the unique energy consumption characteristics of every month.

Issues with the 2.0 model include potential overfitting and that the model is incomplete when it comes to solar PV customers, whose consumption is heavily dependent on the amount of sunlight (solar irradiance). Without an awareness of solar irradiance the model will perform poorly when predicting consumption patterns of solar customers.

The goal of 3.0, is to reduce over- or under-fitting, introduce solar irradiance and other weather variables that may have an impact on consumption, and allow the model to take advantage of the patterns that it recognizes in the data.

The group then discussed in detail various aspects of these challenges and potential solutions. The meeting concluded with a discussion of laying the groundwork for next steps.

Next Meeting Scheduled: Tuesday, October 3rd, 1pm PT.

Watch the full presentation below.

August 1, 2023 | OpenEEmeter Technical Working Group Meeting

8/3/2023

Thanks to everyone who joined us for this week's OpenEEmeter working group.

Adam Scheer led off this week with a discussion of wrapping up the CalTRAck 2.1 daily model and beginning work updating the hourly model in preparation for CalTRACK 3.0. CalTRACK 2.1 makes significant improvements on accuracy and computational efficiency over CalTRACK 2.0 daily (speeds are up to 100 times faster). This is a huge improvement and will make the methods evne more valuable across a variety of use cases. Next steps for CalTRACK 2.1 include releasing detailed model specifications and updating the OpenEEmeter to incorporate CalTRACK 2.1.

Adam then explained how the CalTRACK 2.0 hourly model works, what some of its limitations are, and why we need more adaptable approaches to address modern demand-side programs including electric vehicles, heat pumps, solar + storage, and other load shifting approaches.

To begin work on the CalTRACK 3.0 hourly methods, the team has initiated a literature review aimed at understanding the current state of the art and the latest technological advancements and modeling techniques that have emerged in the past few years. Armin Aligholian explained some of the approaches explored, including load simulation software, statistical learning, machine learning, and Baysean methods.

Next steps:

Focus on low bias model for AMI baseline prediction
Models should be implementable at scale
Explore and implement optimization and machine learning models
Explore and implement probabilistic models
Explore and implement solar weather data
Combine solar disaggregation with AMI modeling
Focus on individual level solar disaggregation
Explore and implement solar PV physical model

Next Meeting Scheduled: Tuesday, September 5th, 1pm PT.

Watch the full presentation and download the slides below.

meeting_9_lfe_openeemeter_wgmtg_8-1-2023.pdf
File Size:	4056 kb
File Type:	pdf

June 6, 2023 | OpenEEmeter Technical Working Group Meeting

6/12/2023

Thanks to everyone who attended the most recent working group meeting.

Adam Scheer led off the meeting with results of testing on how CalTRACK 2.1 running in "2.0 mode" gives almost identical model results to CalTRACK 2.0, while benefitting from the enormous improvements in computational efficiency of CalTRACK 2.1. This is important, because in many use cases (such as when only monthly data is available), users will be running the methods with the older model; they can now have confidence that results will almost identical. Adam then discussed the testing results for CalTRACK 2.1, noting that while not every problem has been solved, many big-ticket items have been addressed. He mentioned the model's improved reliability and more efficient handling of complex data. He pointed out that CalTRACK 2.1 has improved seasonal and weekend/weekday bias dramatically over CalTRACK 2.0.

Wintertime bias across 4,000 residential gas meters went from -7% to -1%.
Summertime bias went from 11% to 5%
Weekday/weekend bias for electric meters went from -3% to 1%.
For commercial buildings, the weekend/weekday bias went from 14% to under 1%.

Adam and Travis then discussed the introduction of an adaptive loss function, a step beyond using mean squared error. Introducing this function yields similar results to the CalTRACK 2.1 model, but is slightly better for some uses cases and does not introduce significant bias. The discussion moved towards the final steps in the development of CalTRACK 2.1. Adam highlighted the importance of good software hygiene and maintaining updated versions. He also talked about the possibility of incorporating new features, such as more efficient data handling and storage capabilities provided by the new version of Panda's library.

The discussion on the future of the working group centered around what comes next, including the potential development of CalTRACK 3.0 and its associated features, such as improved daily modeling and potential incorporation of thermal lag and other factors into the model. The conversation also included a brief mention of possible code consolidation under the OEEM umbrella. Adam noted the increasing use of OpenEEmeter, indicating that it might be beneficial to bring all these tools and methods together for a more streamlined approach.

The team is aiming to get the updates into the OpenEEmeter to be available for others to test within the month.

Next Meeting Scheduled: Tuesday, July 11th, 1pm PT.

Watch the full presentation below.

Join the Technical Working Group

May 2, 2023 | OpenEEmeter Technical Working Group Meeting

5/4/2023

Thanks to everyone who joined for this month's OpenEEmeter working group. We're excited at the huge progress we're making on CalTRACK 2.1, and greatly appreciate all who have taken time out of their schedule to participate in this important discussion.

Tim Guiterman from Sealed led off yesterday's discussion with a follow-up conversation on revising CalTRACK's requirements to permit its usage with delivered fuels such as propane and heating oil. Because these fuels are delivered and manually refilled, data for them are not as consistent nor as abundant as metered energy is. Tim and the Sealed team proposed making an exception to the 70 day data limit for delivered fuels.

The group discussed what the minimum number of data points should be and various factors that could alter this. There was also discussion of relaxing the 365 day baseline period to allow for more data when fitting this period. Most delivered fuel customers can provide data in the form of bills that cover years of time. Adam pointed out that while there might not be a perfect answer to the delivered fuel problem, using the CalTRACK approach is still much better than deemed approaches which are much less accurate.

The conversation concluded with Tim agreeing to analyze Sealed data so that the working group can make a data-driven decision on if rule changes should be made for delivered fuel customers and if so what should the minimum requirements be, with proof that backs up these revisions.

Adam and Travis then summarized CalTRACK 2.0's deficiencies and how the nearly final model of CalTRACK 2.1 rectifies them. The main problem the team was attempting to solve was seasonal bias in the CalTRACK 2.0 model found primarily in gas meters. They showed that CalTRACK 2.1 has significantly reduced seasonal bias. At the same time it has been necessary to revisit how fast the model can be fit in. Travis has implemented several clever means of speeding up the model from worst-case scenarios of 1 minute to fit down to 10 seconds. He also went into detail about how CalTRACK 2.1 differs from CalTRACK 2.0 in order to solve the issues laid out prior. Additionally, Travis briefly mentions that they have implemented a CalTRACK 2.0 mode, a legacy mode, which can replicate CalTRACK 2.0 results with all of the speed improvements developed recently.

Overall, they show that CalTRACK 2.1 reduces seasonal bias by 74% and is between 2 and 100 times faster than CalTRACK 2.0, depending on how it's used (legacy mode or not).

Adam and Travis anticipate that by the next meeting, the final CalTRACK 2.1 model will have completed its R&D phase. Next steps involve code cleanup, polishing, and documentation so that CalTRACK 2.1 can be integrated into the OpenEEmeter.

Next Meeting Scheduled: Tuesday, June 6th, 1pm PT.

Watch the full presentation below.

Join the Technical Working Group

The purpose of this blog is to provide a high-level overview of CalTrack progress.

For a deeper understanding or to provide input on technical aspects of CalTrack, refer to the GitHub issues page (https://github.com/CalTRACK-2/caltrack/issues).

Recordings

2019 CalTRACK Kick Off:

CalTRACK 2.0
July 19, 2018
June 28, 2018
June 7, 2018
May 24, 2018
May 3, 2018
April 12, 2018
March 29, 2018
March 15, 2018
March 1, 2018
February 15, 2018
February 1, 2018

Archives

March 2024
February 2024
January 2024
November 2023
October 2023
September 2023
August 2023
June 2023
May 2023
April 2023
March 2023
February 2023
January 2023
December 2022
November 2022
July 2019
March 2019
February 2019
August 2018
July 2018
June 2018
May 2018
April 2018
March 2018
February 2018