Leveraging Lightweight Large Language Models for Hydrological Interpretation of Precipitation Time Series in Greece

Anastasia Triantafyllou; Eleni Tzanou; Anastasios Bitziadis; Dimitrios Natsiopoulos; Georgios Vergos

Previous Article in event

The role of chemical aging of microplastics- impact on elimination of polyethylene microplastics by coagulation

Next Article in event

Enhancing Basin-Scale Hydrological Insights in Greece by Integrating Machine Learning and Satellite Gravimetry.

Leveraging Lightweight Large Language Models for Hydrological Interpretation of Precipitation Time Series in Greece

Anastasia I. Triantafyllou

¹,

Eleni A. Tzanou

^{*

2},

Anastasios Bitziadis

³,

Dimitrios A. Natsiopoulos

²,

Georgios S. Vergos

¹ Laboratory of Gravity Field Research and Applications – GravLab, Department of Geodesy and Surveying, Aristotle University of Thessaloniki, Greece, GR-54124
² School of Surveying and Geoinformatics Engineering, Faculty of Engineering, International Hellenic University, Greece, GR-62124
³ Rural ad Surveyor Engineer, Aristotle University of Thessaloniki, Greece, GR 52124

Academic Editor: Nikiforos Samarinas

Published: 06 November 2025 by MDPI in The 9th International Electronic Conference on Water Sciences session Remote Sensing, Artificial Intelligence and New Technologies in Water Sciences

Abstract:

As hydrological systems grow increasingly complex and data-rich, new tools are needed to support rapid interpretation and communication of climate and water cycle trends. This study evaluates the applicability of instruction-tuned large language models (LLMs) to interpret long-term precipitation records in the context of hydrological variability. Using a 23-year (2002–2024) ERA5 monthly precipitation time series from a location in Greece, the study tests whether lightweight, open-source models—including TinyLlama (1.1B) and Phi-2 (2.7B)—can generate semantically coherent summaries of seasonal dynamics, detect hydrological anomalies, and answer natural language questions relevant to water resource monitoring.

The time series is preprocessed into prompt-compatible text blocks, enabling models to produce narrative outputs describing dry and wet seasons, interannual shifts, and extreme events. Responses are evaluated against visual and statistical baselines to assess their hydrological fidelity. Phi-2 demonstrates stronger correlation with observed patterns, while TinyLlama provides fluent but less consistent outputs. All models show limitations in numerical reasoning and require tight prompt structuring to avoid hallucinated values.

Our findings suggest that even low-resource LLMs can serve as effective interpretive aids in hydrology, particularly for rapid diagnostics, stakeholder reporting, and data contextualization. When paired with physical constraints and structured input formats, LLMs could enhance exploratory analysis and decision-support capabilities in water management, climate services, and early warning systems. A reproducible Google Colab notebook and annotated model comparisons are included to support further hydrological application and refinement.

Keywords: hydrological time series; large language models (LLMs); climate variability analysis; semantic summarization

7 Reads
0 Recommendations

Anastasia Triantafyllou

Eleni Tzanou

Anastasios Bitziadis

Dimitrios Natsiopoulos

Georgios Vergos