LiveEO supercharges their ML infrastructure and accelerates their geospatial workloads practice

By Toby Rahloff, Phi Nguyen and Alex Streed   

LiveEO accelerates and optimizes their geospatial workloads by up to 65% using Anyscale and Prefect Cloud.

LiveEO, a startup based in Germany, focuses on providing actionable insights on earth observation data by processing large-scale satellite-based images for use cases such as infrastructure monitoring for public utilities and transportation.

Electric grid and railway companies can now use LiveEO satellite asset monitoring solutions to provide actionable insights for various solutions such as storm response, vegetation management, and ground deformation.

liveeo-usecases

Asset monitoring solutions:

  • Vegetation Management Insights: Our vegetation management solution detects and assesses trees and provides prioritized risk evaluations based on vegetation condition and distance to your assets.

  • Ground Movement Insights: Our ground deformation solution detects vertical and lateral ground displacements in the millimeter range, identifies trends and alerts you of any risk to your assets.

  • Rapid Response Insights: Our storm response solution pinpoints damaged sites following a storm, regardless of daylight or cloud cover, and notifies field teams in record time.

On any given day, it is not uncommon for LiveEO to process up to two petabytes of data. It was imperative for LiveEO to build a Python-based solution that could scale efficiently in a timely manner.

LinkBefore using Anyscale and Prefect, the LiveEO data science stack was “hand-crafted”

artisanal-data-science

Fig. Artisanal data science requires a lot of different tools and craftsmanship.

LiveEO's data science stack had some limitations:

  1. It consisted of diverse tools, including a Jupyter Notebook, Airflow, and AWS Batch.

  2. There was a heavy reliance on team tribal knowledge with no standardization, leading to inconsistency across the pipelines.

  3. There needed to be a systematic approach to package and release, making reproducibility and consistency challenging.


As a result of LiveEO's artisanal data science practice, each pipeline was unique and considered a work of art, which made it difficult for other teams to reproduce. The pipelines were semi-automated and required hand-holding, and it was challenging to determine the right size of compute resources for each pipeline. 

Furthermore, limited observability made it difficult to monitor the health of the pipelines and identify any issues in a timely manner. This made recovery from failure a time-consuming, painful process.

These limitations hindered the efficiency and scalability of LiveEO's data science operations and slowed their time to market for their solutions and new features.

LinkLiveEO’s machine learning platform north star

liveeo-north-star

Fig. ML factory

LiveEO’s data science team needed to upgrade their ML stack and had a few goals in mind:

  • Data science pipelines needed to be standardized, automated, versioned and reproducible. 

  • Release packages should be consumed via API or a UI and composable. 

  • The developer experience should abstract the infrastructure and allow for the workload to scale for CPU and GPUs seamlessly all in Python. 

Packages and pipeline updates could be automated, tested, and deployed continuously.

infra-abstraction

Fig. Infrastructure abstraction should be in Python and easy to use.

LinkThe Anyscale and Prefect integration

Anyscale provides a modern ML infrastructure stack and a managed service to scale your Ray workloads, while Prefect provides a powerful workflow automation, orchestration, and monitoring service all in an easy native Python environment leveraging Python decorators.

LiveEO can now leverage Anyscale for instant scalability, and cost efficiency using spot instances for their managed Ray clusters while benefiting from Prefect Cloud to scale their operations using the workflow orchestrator and monitoring solutions. The Anyscale-Prefect built-in integration provides a simple “RayTaskRunner” that can be incorporated into a Prefect flow, automatically scaling LiveEO’s workflows

prefect-ray

Fig. RayTaskRunner allows you to submit tasks in parallel.

liveeo-grafana

Fig. Scaling from 0 to 1k cores in about 1 minute.

LiveEO achieved remarkable results through the combined use of Anyscale and Prefect. By leveraging Anyscale's distributed computing capabilities in tandem with Prefect's intuitive workflow automation and orchestration engine, LiveEO reduced runtime by 30% — translating to a staggering 65% reduction in costs. 

Further, LiveEO's data science operations have been transformed. Teams were able to streamline, automate, and standardize their practices. The result? Increased continuous delivery, and faster and more reliable customer service — all significantly boosting LiveEO's bottom line.

For more on these powerful tools, check out the livestream recording and the prefect-ray integration.

Next steps

Anyscale's Platform in your Cloud

Get started today with Anyscale's self-service AI/ML platform:


  • Powerful, unified platform for all your AI jobs from training to inference and fine-tuning
  • Powered by Ray. Built by the Ray creators. Ray is the high-performance technology behind many of the most sophisticated AI projects in the world (OpenAI, Uber, Netflix, Spotify)
  • AI App building and experimentation without the Infra and Ops headaches
  • Multi-cloud and on-prem hybrid support