How AI Assesses Floating Solar Sites Without Survey

Aerial view of a floating solar installation, assessed and screened without a single site visit.

Floating solar projects start with the same problem: before committing survey budgets, a developer needs to know whether a reservoir is worth pursuing. That means depth data, water quality history, and surface conditions, three separate checks, each one traditionally requiring a site visit. This project set out to answer all three using satellite imagery and AI-based analysis.

Executive Summary

Floating solar is growing fast, but early-stage feasibility is slow and expensive. Before committing survey budgets, developers need to know whether a reservoir has the right depth profile, stable water quality, and workable surface conditions. Getting those answers traditionally means site visits, manual depth measurement, and separate environmental checks for every candidate site in every geography.

Omdena built an AI pipeline that screens floating solar sites using freely available Sentinel-2 satellite imagery. The system removes cloud cover from imagery, detects algal bloom risk across the water surface, and predicts water depth at metre-level resolution, producing structured feasibility data without a physical survey at the screening stage.

On Swanson Reservoir, the pipeline predicted water depth within 25 centimetres of measured values. The Random Forest model also achieved an ROC AUC of 0.95 on depth suitability classification. Data that would previously require field teams can now be generated on a desktop, reducing early-stage assessment time and costs before a single survey is commissioned.

KEY FINDINGS
MAE ~0.25m	depth prediction accuracy on Swanson Reservoir using Random Forest across all 13 Sentinel-2 spectral bands
ROC AUC 0.95	depth suitability classification accuracy of the Random Forest model, correctly classifying water zones by depth suitability on Swanson Reservoir
No on-site survey required during early-stage screening	screening-stage feasibility assessment generated entirely from freely available Sentinel-2 satellite imagery at 10-metre resolution
S2 cloudless library	automated cloud masking, removing affected pixels before any spectral analysis, enabling reliable assessment in high cloud-cover regions
MODIS MYDOCGA V6	algal bloom detection using ocean reflectance bands 15, 14, and 13, sensitive to chlorophyll concentration across the reservoir surface
Random Forest selected	outperformed all alternative models tested on depth prediction; uses all 13 Sentinel-2 bands as input for maximum spectral information

Why Floating Solar Sites Are Hard to Screen

The business case for floating solar is well established: reduced land use, lower evaporation, and higher panel efficiency from water cooling. The bottleneck is pre-project screening. Developers evaluating a reservoir shortlist need answers to three questions before committing capital. Is the water deep enough to anchor floating structures safely? Is water quality stable enough to avoid long-term biofouling and maintenance risk? What does the surface look like across seasons and weather conditions?

Through conventional methods, each answer requires a separate process: on-site depth surveys, water sampling, and repeat site visits across seasons. For international portfolios or large candidate lists, that approach does not scale. A developer assessing twelve reservoirs across three countries cannot afford field visits to all of them before knowing which are worth pursuing.

The cost and time consumed screening sites that never make it past feasibility is a real drag on development pipelines. The opportunity is to compress that screening step by answering the baseline questions faster, earlier, and without sending teams to the site.

The Data: Why Satellite Imagery Works

Sentinel-2 satellites capture multispectral imagery at 10-metre resolution on a regular revisit cycle, covering reservoirs globally at no cost. Each image contains 13 spectral bands that carry information about water surface conditions invisible to the naked eye. Depth, turbidity, biological activity, and surface quality all leave detectable signatures in the spectral data. But raw imagery is not analysis-ready: cloud cover and atmospheric interference mean preparing the data is as important as the modelling itself.

The pipeline draws on two satellite sources, each chosen for what it does best. Sentinel-2 provides the spectral resolution needed for depth modelling, with 13 bands updated on a five-day revisit cycle. MODIS is optimised specifically for water-body surface analysis, with its MYDOCGA V6 product designed to detect biological activity in water. Using each source for its intended purpose produces more reliable results than a single-source approach.

Preparing the Data: Cloud Removal and Water Quality

The first step is removing cloud-covered pixels from each Sentinel-2 image. Using the S2 cloudless library, the pipeline detects and masks cloud-affected areas, leaving only clean surface data for analysis. For reservoirs in tropical or high-altitude regions with frequent cloud cover, this step determines whether useful imagery exists at all for a given time window. Without it, large portions of a candidate site could be invisible to the model during key assessment periods.

Figure 1: Cloud cover removal on Utah Lake: before (left) and after (right) applying the S2 cloudless pipeline. Cloud-masked pixels are removed to leave clean surface data for spectral analysis.

With clean imagery available, the pipeline assesses water quality using MODIS MYDOCGA V6 satellite data, specifically bands 15, 14, and 13, which are sensitive to chlorophyll concentration. The model classifies each surface zone as bloom-present or clean, producing a spatial map of algal bloom risk across the reservoir.

For a developer, that map answers a concrete question: does this reservoir have chronic bloom conditions that would affect panel performance and ongoing maintenance costs? Sites with persistent bloom risk can be flagged or deprioritised before any team visits the location, saving both travel budget and decision-making time.

Figure 2: Algal bloom detection on Lake Erie using MODIS MYDOCGA V6 imagery. The system classifies bloom-present and clean zones across the reservoir surface, producing a spatial water quality risk map.

Predicting Water Depth

Water depth is the critical structural variable for floating solar, determining whether anchoring systems are viable and what infrastructure is required. The team tested multiple model types on this prediction task. Random Forest, using all 13 Sentinel-2 spectral bands as inputs, consistently outperformed alternatives and was selected for the final pipeline.

Before modelling, the team ran correlation analysis across spectral bands, applied PCA to identify the most depth-relevant band combinations, and removed depth readings beyond 25 metres, which fall outside the practical range for floating solar infrastructure. That preparation step is what makes the model reliable on real-world reservoirs rather than clean test data. It also keeps the model focused on the depth range that actually matters for infrastructure decisions.

Swanson Reservoir was used as the primary test case, a real working reservoir with measured bathymetry data available for validation. The model’s outputs were compared directly against those measurements to produce the accuracy figures reported in the Results section.

Figure 3: Random Forest depth prediction on Swanson Reservoir, with predicted water depth values mapped against measured bathymetry data used for model validation.

Figure 4: Model performance comparison: Random Forest outperformed all alternative approaches tested on depth prediction accuracy across the Sentinel-2 spectral dataset.

What the Pipeline Delivers

The three pipeline components run in sequence on each target reservoir: cloud removal, water quality classification, and depth prediction. Together, they produce the core inputs a developer needs for an early-stage feasibility decision: a clean view of the water surface, a bloom risk map, and a depth profile, all from satellite data without a site visit.

INPUT: Sentinel-2 Satellite Imagery + MODIS Data (free, global coverage)
↓	↓	↓
STAGE 1	STAGE 2	STAGE 3
Cloud Removal S2 cloudless library: masks cloud-affected pixels across all imagery	Algal Bloom Detection MODIS MYDOCGA V6 bands 15/14/13: classifies water quality zones	Depth Prediction Random Forest across all 13 Sentinel-2 bands: MAE ~0.25m
	↓
OUTPUT: Depth Map + Bloom Risk Map → Feasibility Assessment Data

The pipeline is built as modular Python components covering data access, preprocessing, and modelling, meaning it can be applied to new reservoirs without rebuilding from scratch. It draws on the Bathybase Dataset and Global Reservoir Dataset for bathymetric reference values, providing a validated baseline for depth calibration on new sites.

One current constraint is worth stating directly: the depth model is calibrated per lake rather than universally. A developer assessing a new reservoir runs the model fresh for that site. For portfolio screening, that means a calibration step per site. That is not a blocker for using the tool, but a factor in planning how it fits into the workflow.

The practical shift is when fieldwork happens. Site visits, if they happen at all, come after satellite-based screening has already filtered the candidate list, answered the baseline questions, and identified which sites merit closer investigation. The cost of assessing a site that does not proceed is reduced to the cost of running the pipeline, not the cost of sending a survey team.

What the Model Achieved

On Swanson Reservoir, the Random Forest model predicted water depth with a mean absolute error of approximately 25 centimetres across the full depth range relevant to floating solar infrastructure. That level of accuracy is sufficient to make go/no-go decisions on anchoring feasibility without commissioning a physical bathymetric survey.

The same Random Forest model achieved an ROC AUC of 0.95 for depth-suitability classification, correctly classifying water zones by depth suitability in 95% of cases. Both metrics were produced from satellite data alone, with no field measurement required at the screening stage.

Random Forest outperformed all alternative models tested on this task. Several candidate approaches failed on the dataset; others showed overfitting that did not hold under real-world conditions. The final model’s robustness came in part from the data preparation steps, including removing readings beyond 25 metres and applying PCA to isolate the most depth-relevant spectral bands before training.

How This Works in Practice

A developer with twelve candidate reservoirs across three countries runs satellite-based assessment across the full list before any survey budgets are committed. Depth profiles, water quality flags, and seasonal surface conditions are compared across all sites. Those that fail basic viability criteria are eliminated in the first week. Survey teams go only to sites that have already passed remote screening, meaning every field visit starts with validated data rather than open questions.

An existing floating solar operator uses the cloud-removal and algal-bloom layers to track reservoir conditions year-round. When bloom activity spikes in late summer, a predictable seasonal pattern on many reservoirs, the team schedules panel cleaning before performance drops rather than after. Maintenance windows are planned, not reactive. The same satellite data that informed the initial site decision continues to generate value throughout the operational lifecycle.

The Numbers

Metric	Result
Depth prediction error	MAE ~0.25m
Depth suitability classification	ROC AUC 0.95
Satellite data source	Sentinel-2 (free, global coverage)
On-site survey required	Not at the screening stage

About the Project

Omdena led this project on AI-powered floating solar site assessment for Glint Solar, running a satellite data pipeline that covered cloud removal, algal bloom detection, and water depth prediction from Sentinel-2 imagery. The modular pipeline was delivered with documentation and datasets, ready for integration into floating solar feasibility workflows.

This project is one of over 500 Omdena has delivered across AI, machine learning, and computer vision, spanning solar energy, climate, agriculture, healthcare, and infrastructure. Each project follows the same structure: cross-functional engineering teams, parallel workstreams, and delivery on a fixed timeline.

AI-powered feasibility assessment is now a practical capability for floating solar development teams. Omdena delivers that capability at around a third of what enterprise AI development typically costs, through a proprietary agentic AI platform and a structured delivery process.

To discuss how this could apply to your floating solar pipeline, reach out to Omdena.

FAQ

Does this replace on-site surveys entirely?

Not for final engineering design, but it replaces surveys at the screening stage, where most of the wasted spend occurs. Sites that do not pass remote assessment never incur a survey cost, and those that do proceed to fieldwork arrive with baseline questions already answered.

What types of reservoirs does this work on?

Any reservoir with Sentinel-2 coverage, which is global. The depth model is calibrated per site, so new reservoirs require a calibration run before full prediction can be performed. The cloud removal and algal bloom detection components work out of the box across all geographies.

How long does an assessment take?

Significantly faster than physical surveys. Sentinel-2 imagery is available on demand, and the pipeline processes it in a computational manner rather than through field teams. Calibration time per new site is the main variable; once calibrated, the pipeline runs continuously.

What does Omdena deliver?

A working pipeline configured for the client's target reservoirs, with outputs structured for direct use in feasibility decision-making: depth maps, bloom risk maps, and the supporting data analysis that underpins both. Delivery includes documentation, saved models, and the modular codebase, so the client's team can run it on new sites independently after handover.

Share this article

Share on LinkedIn, send by email, or copy the direct link.

LinkedIn Email

Floating Solar Site Assessment with AI: How Satellite Data Replaces Early-Stage Surveys — Omdena Case Study