Events The 1st International Online Conference on Marine Science and Engineering

Event submissions

Published

This submission belongs to the session A. Ocean Engineering of the event The 1st International Online Conference on Marine Science and Engineering

Published date

19 Nov, 2025

Academic Editor

Dong-Sheng Jeng

Citation

Chao Feng Shih, A Review of Current Developments in Generative Artificial Intelligence for Underwater Marine Environments, in Proceedings of The 1st International Online Conference on Marine Science and Engineering, 24 November–26 November 2025, MDPI: Basel, Switzerland

Facebook

Twitter

A Review of Current Developments in Generative Artificial Intelligence for Underwater Marine Environments

Chao Feng Shih ¹

1. Department of Marine Engineering, National Taiwan Ocean University, Keelung 202301, Taiwan, Taiwan

Abstract

This study investigates the application of generative artificial intelligence visual language models for object detection and obstacle recognition in underwater remotely operated vehicles (ROVs). By combining open-source underwater image datasets with images collected by ROVs, we systematically compare the performance of multiple advanced visual language models. The experimental design encompasses three typical underwater scenarios, aquaculture, marine exploration, and environmental monitoring, to evaluate the models' adaptability under varying underwater environmental conditions. We employ four key indicators for quantitative evaluation: accuracy, which reflects the model's ability to minimize false positives; recall, which measures the completeness of its detection of true targets; F1-score, which comprehensively balances the two; and average precision, which assesses the model's positioning accuracy under an overlap threshold of 50%. The results indicate that model performance is significantly influenced by environmental complexity. For instance, in turbid waters, the recall rate of all models decreases by approximately 15%, underscoring the unique challenges presented by underwater scenes. Additionally, we found that the models' ability to recognize small targets is generally inadequate, necessitating further optimization of the feature extraction architecture or the introduction of domain adaptation training in future work.

Keywords

Underwater object detection

visual language model

generative

edge computing.

Poster

IOCMSE_poster_20251024.pdf

Evaluation of Reduction and Validation Strategies in the Prediction of Extreme Ocean Events