Onzo

Onzo operate an energy analytics platform. This allows utilities to make the most of their customer data.

Onzo

Industry: Energy
Size: SME
Location: London, UK
Website: www.onzo.com

Tech used

Scala (Akka)
Python (Tensorflow)
Kubernetes

Features

Streaming and batch processing
Support for 5 million homes
Advanced machine learning

My role

This was my first project working in the energy analytics sector. I was hired as the lead architect, responsible for the complete platform. The existing platform was unreliable and did not scale well. We decided to build a new platform that is stable, reliable and scalable. The platform needed to support both streaming and batch analytics. I was also partly involved in a funding round. We successfully raised cash from a business partner and two private equity firms.

Major challenges

Firstly we needed to migrate customers from the legacy platform to the new one. This proved to be challenging as our customers were able to make few, if any changes on their side. I didn’t want to carry the legacy of previous design decisions with us. We needed to strike a balance between starting with a clean sheet and not breaking existing deployments. Another challenge was the rapidly changing business environment. The functional requirements were changing on an almost daily basis, requiring a very fast turnaround.

The solution

In the end, I built two “platforms” to support the batch and streaming analytics. I discovered that there was very little overlap in terms of functional or non-functional requirements between the two platforms. The streaming platform run on the Akka stack. I chose core Akka, Streams, Persistence & Clustering. The batch platform is based on Apache Spark. Both systems used Apache Cassandra as their data store. The machine learning elements of the system are built primarily in Python/Tensorflow

Scala
Akka
Python
Google Tensorflow
Kubernetes

What did I learn?

Firstly I learned a lot about the industry and was able to transfer my experience to the eon optimum project.

When the direction of the business is unclear it’s best to get something up and running quickly. We need to accept that software developed this way will have to be rewritten at a later date. This is usually a hard sell to CFOs and investors. However, there is tremendous value in writing “throw away” code. The business can validate which ideas are actually viable. The tech teams can discover the challenges and pitfalls of a particular domain at relatively low cost.

It’s not the first time I’ve discovered this principle. Over the years, I’ve worked on many software projects, I would have to admit that we rarely get it right first time. My time at Onzo really reinforced this belief.

The data scientists need to “own” their models. When I joined Onzo the data scientists created models that were essentially proof of concepts. They would often be developed in Python notebooks. The POCs were then handed to Scala developers who were responsible for turning them into Scala code. This process didn’t work well. We moved to a position in which the data scientists owned the models through to production. This was far more sustainable.

The Results

Scale

5M homes

build for 5 million homes

Funding

3 investors

participated in the funding round

Team size

17 new devs

10 new developers joined us

Customers

5 new

orders from 5 major clients