<?xml version="1.0" encoding="utf-8"?><article><front><Journal-meta><journal-id journal-id-type='publisher'>CWE/1779/2026</journal-id><journal-title >Current World Environment</journal-title><issn pub-type='PPub'>0973-4929</issn><issn pub-type='ePub'>2320-8031</issn><publisher><publisher-name>4</publisher-name></publisher></Journal-meta><article-meta><article-id pub-id-type='other'>CWE--100-00</article-id><title-group><article-title>&lt;p&gt;Daily Rainfall Forecasting Across Different Divisions of Assam Using Gradient Boosting with Physically Informed Feature Engineering&lt;/p&gt;</article-title></title-group><contrib-group><contrib contrib-type='author'><name><surname></surname><given-names></given-names></name><xref ref-type='aff' rid='aff00'><sup></sup></xref></contrib><contrib contrib-type='author'><name><surname></surname><given-names></given-names></name><xref ref-type='aff' rid='aff00'><sup></sup></xref></contrib></contrib-group><aff id='aff001'><sup>1</sup><instname></instname>,<deptname>Department of Statistics</deptname>, <instaddress>Cotton University</instaddress>, <instcity>Guwahati</instcity>, <instcountry>India</instcountry>.</aff><pub-date pub-type='ppub'><publicationDate></publicationDate></pub-date><doi>10.12944/CWE.21.1.20</doi><volume>Volume 21</volume><issue>Volume 21</issue><page>287-323</page><abstract><title>Abstract</title><p>Daily rainfall forecasting in Assam, northeast India, remains challenging due to strong monsoon variability and complex terrain. This study compares Multiple Linear Regression (baseline), XGBoost, and LightGBM for daily precipitation prediction across five hydro-climatic regions of Assam using a 24-year meteorological dataset (2001–2024) from IMD (India Meteorological Department) and NASA POWER. Models were trained on 2001–2023 and evaluated on an independent 2024 test year. Both gradient boosting models substantially improved prediction accuracy relative to linear regression, achieving R² (Coefficient of Determination) values of 0.775–0.974 and reducing mean absolute error by up to 85%. Event-detection skill was also strong, with CSI (Critical Success Index) values of 0.852–0.958 at the 5 mm/day rainfall threshold and consistent detection of heavy rainfall events at 25 mm/day, where linear regression showed very limited skill. Seasonal analysis indicated higher uncertainty during the peak monsoon season, particularly in the orographically complex Upper Assam region where LightGBM monsoon R² declined to 0.687. The analysis is limited by the use of single representative grid points per region and evaluation on a single test year, which may not fully capture inter-annual variability. Nevertheless, the results suggest that gradient boosting models with temporal feature engineering and strict train–test separation provide a promising framework for operational rainfall forecasting in monsoon-dominated regions such as Assam.</p></abstract><kwd-group><title>Keywords</title><kwd>Feature Engineering</kwd><kwd> LightGBM</kwd><kwd> Multiple Regression</kwd><kwd> XGBoost</kwd></kwd-group><counts><ref-count count='' /><page-count count='' /></counts></article-meta></front></article>