AI Prediction Accuracy Ranking: Where AI Excels and Struggles

2026-03-16

Our site uses a LightGBM machine learning model to predict visibility, but accuracy varies dramatically by site. IOP achieves R²=0.82 (highly accurate), while Yonaguni sits at R²≈0.05 (virtually unpredictable). Let's explore why through data.

0.82

Best R² (IOP)

0.00

Worst R² (Osezaki Bay)

0.824

Global model R²

45

Features

What is R² (coefficient of determination)?

R² ranges from 0 to 1, where 1 means perfect prediction. R²=0.82 means the model explains 82% of visibility variation. R²=0.05 means only 5% — essentially random.

AI Prediction Accuracy by Site

#SiteGradeModelObs
1IOP0.820ExcellentPer-site3,151
2Kannoura0.660GoodPer-site580
3Shirasaki0.603GoodGlobal420
4Hirasawa0.594GoodGlobal2,696
5Futone0.566GoodGlobal780
6Echizen0.455FairPer-site2,652
7Kumomi0.477FairGlobal1,980
8Miyakejima0.434FairPer-site941
9Futo0.430FairGlobal3,493
10Kushimoto0.426FairGlobal3,168
11Kerama0.250PoorGlobal1,533
12Ishigaki0.150PoorGlobal1,473
13Yonaguni0.050UnpredictableGlobal4,826
14Osezaki Bay0.001UnpredictableGlobal1,200

R² evaluated on test data. Global = trained on all sites combined; Per-site = trained on individual site data.

Characteristics of Predictable Sites

Open ocean exposure

Sites like IOP and Futone face open ocean, where weather and sea conditions (waves, swell, wind) directly affect visibility. The rain → river runoff → turbidity causal chain is clear.

Clear seasonal patterns

IOP (winter-high, summer-low) and Echizen (summer peak) have regular monthly patterns that AI learns easily. The month_sin/cos features work well here.

Characteristics of Unpredictable Sites

Current-driven visibility changes

Yonaguni and Ishigaki visibility swings with Kuroshio path changes, but current paths can't be predicted from weather data alone. Adding ocean current features is a future goal.

Enclosed bay topography

Osezaki Bay has R²≈0.00. Enclosed bays are insulated from open ocean conditions, dominated by tidal and subtle topographic factors that our current features can't capture.

Consistently high visibility year-round

Sites like Yonaguni (24.5m annual avg) have minimal variation, making small changes hard to predict. Paradoxically, low prediction accuracy can mean consistently clear water — not necessarily a bad thing.

Practical Implications for Divers

Sites where AI forecast is useful

IOP, Hirasawa, Shirasaki, Kannoura — Izu to Kii Peninsula sites. Check our forecast a few days out, like a weather forecast.

Sites where local intel beats AI

Yonaguni, Ishigaki, Kerama, Osezaki Bay. For these, check the local shop's recent posts or SNS. Our AI forecast is just a rough reference.

Future Accuracy Improvements

  • Adding ocean current features (Kuroshio path/speed) → expected improvement for Yonaguni, Ishigaki
  • Adding tidal data → better predictions for enclosed bay sites
  • Real-time satellite data (chlorophyll, turbidity) → baseline improvement for all sites
  • Continued data collection → more observations means better model accuracy

About the Data

R² values from LightGBM model evaluation on test data (time-series split). Global model R²=0.824 is the baseline. Per-site models trained on sites with 1,000+ observations. Results as of March 2026.

🌊 Check Visibility Forecasts

View AI-powered 7-day visibility forecasts for 30+ dive sites across Japan.

Open Forecast App →