<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "http://jats.nlm.nih.gov/publishing/1.0/JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="1.0" article-type="research-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="nlm-ta">AI Agent</journal-id>
      <journal-id journal-id-type="publisher-id">aiagent</journal-id>
      <journal-title-group>
        <journal-title>AI Agent</journal-title>
      </journal-title-group>
      <issn pub-type="epub">3070-3719</issn>
      <publisher>
        <publisher-name>OAE Publishing Inc.</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.20517/aiagent.2026.01</article-id>
      <article-id pub-id-type="publisher-id">AIAgent-2026-1</article-id>
      <article-categories>
        <subj-group>
          <subject>Original Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>An energy-efficient scheduling approach for wind-solar-hydrogen systems based on distributed reinforcement learning</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Zhang</surname>
            <given-names>Bo</given-names>
          </name>
          <xref ref-type="aff" rid="I1">
            <sup>1</sup>
          </xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Wang</surname>
            <given-names>Conghao</given-names>
          </name>
          <xref ref-type="aff" rid="I1">
            <sup>1</sup>
          </xref>
        </contrib>
        <contrib contrib-type="author" corresp="yes">
          <name>
            <surname>Ma</surname>
            <given-names>Yan</given-names>
          </name>
          <xref ref-type="aff" rid="I1">
            <sup>1</sup>
          </xref>
          <xref ref-type="corresp" rid="cor1">*</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Xie</surname>
            <given-names>Jingjing</given-names>
          </name>
          <xref ref-type="aff" rid="I1">
            <sup>1</sup>
          </xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>He</surname>
            <given-names>Liang</given-names>
          </name>
          <xref ref-type="aff" rid="I1">
            <sup>1</sup>
          </xref>
        </contrib>
      </contrib-group>
      <aff id="I1000">School of Software, Northwestern Polytechnical University, Xi’an 710072, Shaanxi, China.</aff>
      <author-notes>
        <corresp id="cor1">Correspondence to: Assoc. Prof. Yan Ma, School of Software, Northwestern Polytechnical University, Xi’an 710072, Shaanxi, China. E-mail: <email>yma@nwpu.edu.cn</email></corresp>
        <fn fn-type="other">
          <p><bold>Received:</bold> 22 Jan 2026 | <bold>First Decision:</bold> 20 Mar 2026 | <bold>Revised:</bold> 3 Apr 2026 | <bold>Accepted:</bold> 12 May 2026 | <bold>Published:</bold> 29 May 2026</p>
        </fn>
        <fn fn-type="other">
          <p><bold>Academic Editor:</bold> Hao Li | <bold>Copy Editor:</bold> Xing-Yue Zhang | <bold>Production Editor:</bold> Xing-Yue Zhang</p>
        </fn>
      </author-notes>
      <pub-date pub-type="ppub">
        <year>2026</year>
      </pub-date>
      <pub-date pub-type="epub">
        <day>29</day>
        <month>5</month>
        <year>2026</year>
      </pub-date>
      <volume>2</volume>
	  <issue>2</issue>
      <elocation-id>21</elocation-id>
      <permissions>
        <copyright-statement>© The Author(s) 2026.</copyright-statement>
        <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
          <license-p>© The Author(s) 2026.<bold>Open Access</bold>This article is licensed under a Creative Commons Attribution 4.0 International License (<uri xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</uri>), which permits unrestricted use, sharing, adaptation, distribution and reproduction in any medium or format, for any purpose, even commercially, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.</license-p>
        </license>
      </permissions>
      <abstract>
        <p>This paper presents a comprehensive energy dispatch strategy based on distributed reinforcement learning to optimize the operation of integrated wind-solar-hydrogen systems. The proposed approach effectively reduces coal fuel costs and carbon emissions while ensuring precise load demand tracking. By implementing a distributed computing framework, the computational challenges associated with training the Deep Deterministic Policy Gradient algorithm on large-scale datasets are effectively addressed. This parallel architecture significantly enhances training efficiency and improves scalability for complex energy management tasks. Additionally, an efficient load pattern identification method, enhanced by Principal Component Analysis and K-means clustering, is developed to capture the salient characteristics of electricity load data. Furthermore, a high-fidelity representative scenario extraction approach, utilizing Dynamic Time Warping and Density-Based Spatial Clustering of Applications with Noise, is proposed to characterize the inherent uncertainties in wind and solar power generation. The integration of hydrogen-based energy storage is proposed as a flexible and sustainable solution to enhance system reliability and mitigate carbon emissions. Empirical simulation results demonstrate that the proposed methodology significantly reduces fuel costs and minimizes carbon emissions while exhibiting improved robustness and computational efficiency. By incorporating hydrogen storage systems and carbon trading mechanisms, the proposed approach optimally facilitates the integration of wind and solar power, thereby providing a comprehensive framework for the efficient operation of hybrid energy systems.</p>
      </abstract>
      <kwd-group>
        <kwd>Carbon trading</kwd>
        <kwd>multi-objective optimization</kwd>
        <kwd>distributed reinforcement learning</kwd>
        <kwd>wind-solar-hydrogen systems</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec1">
      <title>INTRODUCTION</title>
      <p>The Paris Agreement establishes a critical global objective to significantly mitigate greenhouse gas emissions, targeting a temperature increase limit of well below 2 °C relative to pre-industrial levels. In alignment with this international framework, numerous nations have formulated Nationally Determined Contributions (NDCs), committing to achieve net-zero emissions by 2050<sup>[<xref ref-type="bibr" rid="B1">1</xref>]</sup>. As a pivotal participant in this global effort, China has pledged a comprehensive transition from fossil fuel-based power generation to renewable energy technologies, aiming for carbon neutrality by 2060<sup>[<xref ref-type="bibr" rid="B2">2</xref>]</sup>. This commitment was formally institutionalized during the United Nations General Assembly, where China unveiled its “3060 Dual Carbon” goal, outlining a strategic 40-year roadmap toward net-zero emissions<sup>[<xref ref-type="bibr" rid="B3">3</xref>]</sup>. Nevertheless, the nation encounters a formidable challenge due to its profound reliance on coal for thermal power generation, which accounts for approximately 40% of its total carbon footprint<sup>[<xref ref-type="bibr" rid="B4">4</xref>-<xref ref-type="bibr" rid="B6">6</xref>]</sup>. Consequently, the modernization of energy infrastructure is imperative for China to fulfill its rigorous emission reduction targets<sup>[<xref ref-type="bibr" rid="B7">7</xref>,<xref ref-type="bibr" rid="B8">8</xref>]</sup>. Driven by the accelerated deployment of renewable resources and advancements in energy efficiency, contemporary research has increasingly focused on the synergistic integration of heterogeneous renewable sources. Extensive studies highlight promising configurations, such as wind-solar hybrid systems<sup>[<xref ref-type="bibr" rid="B9">9</xref>-<xref ref-type="bibr" rid="B12">12</xref>]</sup> and multi-source architectures incorporating wind, solar, thermal, and energy storage units<sup>[<xref ref-type="bibr" rid="B13">13</xref>-<xref ref-type="bibr" rid="B16">16</xref>]</sup>. Among these architectures, wind-solar-hydrogen systems are recognized as essential for low-carbon power generation, offering enhanced reliability and economic efficiency compared to standalone power plants<sup>[<xref ref-type="bibr" rid="B17">17</xref>,<xref ref-type="bibr" rid="B18">18</xref>]</sup>. However, the large-scale integration of wind and solar energy - projected to be China’s dominant electricity sources - requires addressing critical challenges related to inherent intermittency<sup>[<xref ref-type="bibr" rid="B19">19</xref>]</sup>, load fluctuations, and seasonal supply-demand mismatches. To navigate these complexities, the hydrogen energy storage system (HESS) has emerged as a viable solution, facilitating long-duration and seasonal energy balancing within high-renewable grids. Hydrogen storage enhances the temporal flexibility of clean energy utilization and supports decarbonized electricity production while enabling strategic participation in carbon markets. Consequently, an assessment of the spatiotemporal complementarity between wind-solar resources and their alignment with electricity demand is vital for developing high-performance, renewable-dominated power systems. Existing research suggests that the decarbonization of power generation can be effectively pursued through hybrid wind-solar-hydrogen systems, although carbon footprints vary significantly depending on the specific generation strategies employed<sup>[<xref ref-type="bibr" rid="B20">20</xref>-<xref ref-type="bibr" rid="B23">23</xref>]</sup>. This variation underscores the urgent necessity of optimizing CO<sub>2</sub> reduction strategies within integrated wind-solar-hydrogen frameworks. Beyond traditional optimization, autonomous AI agents are emerging as powerful tools for accelerating scientific discovery and conducting end-to-end research tasks in the hydrogen energy sector<sup>[<xref ref-type="bibr" rid="B24">24</xref>]</sup>.</p>
      <p>The intermittent nature of renewable energy sources, specifically wind and solar power, presents significant operational challenges due to their inherent dependence on environmental and climatic conditions. This inherent variability in power generation not only exacerbates the risk of system failures but also introduces substantial uncertainty into energy dispatch modeling<sup>[<xref ref-type="bibr" rid="B25">25</xref>-<xref ref-type="bibr" rid="B27">27</xref>]</sup>. Contemporary energy dispatch approaches are broadly categorized into uncertainty-based and deterministic dispatch models. The former category addresses renewable variability by simulating probabilistic power output scenarios to achieve environmentally differentiated generation through scenario feature identification<sup>[<xref ref-type="bibr" rid="B28">28</xref>]</sup>. Conversely, deterministic models rely on forecasted generation curves for long-term planning while maintaining sufficient reserve capacity to accommodate fluctuations, making them suitable for conventional units, demand response systems, electric vehicles, and energy storage applications. To better characterize wind power uncertainty, advanced statistical methods have been developed to enhance modeling accuracy. For instance, the Wasserstein distance metric enables the derivation of optimal discrete distributions for wind power generation<sup>[<xref ref-type="bibr" rid="B29">29</xref>]</sup>, while the Frank-Copula function facilitates the construction of joint probability distributions. These techniques, integrated with roulette wheel selection methods, generate comprehensive base scenario sets that can be further refined through improved spectral clustering algorithms incorporating noise filtering and distance correlation strategies. This systematic refinement process yields representative scenario sets that support two-level collaborative optimization models for integrated energy systems<sup>[<xref ref-type="bibr" rid="B30">30</xref>]</sup>. Recent methodological advancements include the quantile regression technique combined with dimensionality reduction clustering techniques, which leverage historical statistical data to transform deterministic prediction sequences into probabilistic output scenario sets<sup>[<xref ref-type="bibr" rid="B31">31</xref>]</sup>. Alternative approaches employ Latin hypercube sampling for scenario generation followed by fast antecedent elimination techniques to enhance scenario reduction efficiency<sup>[<xref ref-type="bibr" rid="B32">32</xref>]</sup>. In the present study, we implement a Principal Component Analysis (PCA)-enhanced K-means clustering approach for efficient load pattern identification<sup>[<xref ref-type="bibr" rid="B33">33</xref>]</sup>. Furthermore, we develop a high-fidelity representative day extraction method based on Dynamic Time Warping (DTW) and Density-Based Spatial Clustering of Applications with Noise (DBSCAN) for annual renewable generation data. These advanced techniques achieve significant data dimensionality reduction while preserving critical temporal characteristics, thereby providing a robust foundation for subsequent energy dispatch strategy implementation. The methodological framework maintains a rigorous representation of seasonal variations in wind and solar power generation patterns, ensuring the practical applicability of the resulting dispatch solutions.</p>
      <p>Advanced energy dispatch methodologies are instrumental in enhancing the operational performance and reliability of multi-source power supply architectures. Current energy dispatch approaches for such systems can be categorized into three main types: rule-based, optimization-based, and learning-based methods<sup>[<xref ref-type="bibr" rid="B34">34</xref>-<xref ref-type="bibr" rid="B36">36</xref>]</sup>. Rule-based strategies, while straightforward to implement, face significant limitations due to their dependence on predefined human knowledge. These approaches struggle with diverse input scenarios and require substantial time and resources to maintain as operational conditions evolve<sup>[<xref ref-type="bibr" rid="B34">34</xref>,<xref ref-type="bibr" rid="B37">37</xref>]</sup>. Optimization-based methods have demonstrated significant efficacy in resolving problems defined by complex sub-problem interdependencies and nested optimal substructures. Robust optimization algorithms have been successfully implemented across both wind and solar energy frameworks<sup>[<xref ref-type="bibr" rid="B38">38</xref>,<xref ref-type="bibr" rid="B39">39</xref>]</sup>, whereas particle swarm optimization has yielded promising results for hybrid renewable configurations, facilitating the development of coordinated control strategies for energy storage<sup>[<xref ref-type="bibr" rid="B40">40</xref>]</sup>. The Sequential Least Squares Programming (SLSQP) algorithm has been utilized to develop integrated dispatch frameworks for multi-source energy management, markedly enhancing resource utilization within wind-solar-hydrogen architectures. Although Convex Optimization based SLSQP implementations have been validated for augmenting dispatch efficacy and satisfying load requirements, they are often characterized by significant computational overhead, necessitating extensive iterations to achieve convergence<sup>[<xref ref-type="bibr" rid="B35">35</xref>]</sup>. Several challenges persist in current dispatch approaches. The inherent uncertainty of wind and solar generation remains inadequately addressed, and power adjustment mechanisms across supply-side components require further refinement. Additionally, while multi-energy, multi-device architectures require hourly resolution for modeling accuracy, the resulting multi-scale dispatch problems often exhibit prohibitive computational complexity, complicating the trade-off between real-time efficiency and long-term system stability. These computational demands stem from the need to simultaneously optimize multiple energy sources while accounting for their temporal variability and operational constraints. A comprehensive assessment of environmental costs in wind-solar-hydrogen systems requires multi-dimensional analysis, particularly examining the interplay between integrated energy-carbon pricing mechanisms and the system’s operational dynamics in relation to carbon emissions. Although current research underscores the pivotal role of energy storage systems in facilitating low-carbon transitions<sup>[<xref ref-type="bibr" rid="B41">41</xref>]</sup>, contemporary economic evaluations suggest that carbon trading frameworks primarily incentivize emission reductions through operational enhancements in energy efficiency, rather than catalyzing foundational industrial transformations. This insight carries important implications for the economic viability of large-scale power generation facilities pursuing de-carbonization strategies<sup>[<xref ref-type="bibr" rid="B42">42</xref>]</sup>. Furthermore, the dual approach of enhancing energy efficiency while optimizing industrial structures has been shown to augment the complementary emission mitigation capacity within emissions trading schemes, suggesting promising pathways for more effective climate mitigation in the power sector.</p>
      <p>Existing deep reinforcement learning (DRL) applications in energy dispatch frequently rely on centralized or single-agent architectures, which are susceptible to the curse of dimensionality and exhibit limited scalability when managing large-scale, multi-source systems<sup>[<xref ref-type="bibr" rid="B35">35</xref>,<xref ref-type="bibr" rid="B36">36</xref>]</sup>. Furthermore, prior economic formulations typically overlook complex market interactions such as carbon emission trading mechanisms. To address these critical gaps, this work proposes a comprehensive, system-level energy dispatch framework for a grid-connected wind-solar-hydrogen integrated energy system that incorporates thermal backup. As shown in <xref ref-type="fig" rid="fig1">Figure 1</xref>, the framework includes four components. First, a PCA-enhanced K-means clustering model identifies typical load patterns from annual data, reducing computation burdens while maintaining accuracy. Second, a DTW-DBSCAN method extracts representative days from renewable generation data, capturing seasonal and uncertain variations. Third, a DDPG algorithm optimizes dispatch strategies through continuous agent-environment interaction, allowing real-time adjustments and improving long-term performance. Fourth, HESS is integrated to provide long-duration and seasonal energy balancing. HESS enables bidirectional conversion between electricity and hydrogen. Although its efficiency is nonlinear and power-dependent, a constant round-trip efficiency is assumed for system-level modeling. Distributed learning improves training speed and supports the inclusion of carbon trading signals in dispatch decisions. The main contributions of this paper are summarized as follows: (1) Develops a joint data-driven pipeline integrating PCA-enhanced K-means clustering and DTW-DBSCAN to mitigate computational complexity while preserving high-fidelity representation of renewable generation uncertainties; (2) Proposes a highly scalable distributed training architecture that overcomes the dimensionality and computational limitations of conventional centralized DRL methods, significantly accelerating training speed and enabling efficient, stable scheduling in complex wind-solar-hydrogen systems; (3) Formulates a comprehensive economic model that systematically internalizes coal fuel consumption, carbon emission costs under trading mechanisms, and electricity procurement expenses into the multi-objective optimization process.</p>
      <fig id="fig1" position="float">
        <label>Figure 1</label>
        <caption>
          <p>The framework includes four components: (1) a PCA-enhanced K-means clustering model for identifying load patterns; (2) a DTW-DBSCAN method for extracting representative renewable generation days; (3) a DDPG algorithm for optimizing dispatch strategies; and (4) HESS integration for long-duration energy balancing. PCA: Principal Component Analysis; DTW: Dynamic Time Warping; DBSCAN: Density-Based Spatial Clustering of Applications with Noise; DDPG: Deep Deterministic Policy Gradient; HESS: hydrogen energy storage system.</p>
        </caption>
        <graphic xlink:href="aiagent2001.fig.1.jpg"/>
      </fig>
      <p>The remainder of this paper is organized as follows. Section EXPERIMENTAL presents the experimental methodology, including model development for the wind-solar-hydrogen systems and the design of the distributed reinforcement learning methods. Section RESULTS AND DISCUSSION provides the results and discussion across various scenarios. Finally, Section CONCLUSIONS concludes the paper.</p>
    </sec>
    <sec id="sec2">
      <title>EXPERIMENTAL</title>
      <sec id="sec2-1">
        <title>Efficient load pattern identification: A PCA-enhanced and K-means clustering model</title>
        <p>Clustering power load data is essential for identifying representative load patterns, which are fundamental to load forecasting, demand response programs, and efficient grid management. Traditional clustering techniques, such as Gaussian Mixture Models (GMM), have been extensively utilized due to their capacity to characterize complex data distributions via probabilistic assignments. However, GMM exhibits significant limitations: it is computationally burdensome for high-dimensional datasets and relies heavily on the assumption of underlying Gaussian distributions, which frequently fails to align with empirical load profiles. Furthermore, GMM requires meticulous initialization and shows high sensitivity to hyper-parameter configurations, including the number of components and covariance structures. To circumvent these challenges, we propose a framework integrating PCA with K-means clustering. PCA effectively reduces dimensionality by extracting dominant features, thereby enhancing computational throughput and mitigating the impact of noise. When applied to the reduced feature space, K-means clustering offers a rapid and interpretable methodology for partitioning load profiles into distinct, meaningful cohorts. Compared to GMM, the proposed PCA-K-means approach demonstrates enhanced scalability, reduced computational overhead, and greater ease of implementation, making it highly suitable for large-scale utility data analytics. This synergistic combination not only preserves the intrinsic structural properties of the data but also facilitates the robust identification of representative power consumption patterns.</p>
        <p>Let <inline-formula><tex-math id="M1">$$ X \in \mathbb{R}^{n \times 24} $$</tex-math></inline-formula> represent the matrix of normalized daily load profiles, where each row denotes a specific day and each column corresponds to an hourly interval. PCA is subsequently employed to project this data into a lower-dimensional feature space while retaining the maximum variance. The optimal number of principal components, <italic>m</italic>, is determined such that the cumulative explained variance exceeds a predefined threshold, <italic>τ</italic>. In this study, the threshold <italic>τ</italic> is established at 0.95 to ensure high information retention. Following dimensionality reduction, K-means clustering is applied to the transformed dataset <inline-formula><tex-math id="M2">$$ Z \in \mathbb{R}^{n \times m} $$</tex-math></inline-formula>. The objective is to partition the dataset into <italic>k</italic> distinct clusters, <italic>C<sub>i</sub></italic>, by minimizing the total intra-cluster variance.</p>
        <p><disp-formula> <label></label> <tex-math id="E1"> $$ \underset{C_1,\ldots,C_k}{\text{argmin}} \sum_{i=1}^{k} \sum_{z_j \in C_i} \| z_j - \mu_i \|^2 $$ </tex-math></disp-formula></p>
        <p>where <italic>z<sub>j</sub></italic> denotes the <italic>j</italic>-th load profile in the reduced feature space, and <italic>μ<sub>i</sub></italic> represents the centroid of cluster <italic>C<sub>i</sub></italic>. For each cluster, a representative day is identified as the original profile whose lower-dimensional representation is closest to the respective cluster centroid.</p>
        <p><disp-formula> <label></label> <tex-math id="E2"> $$ x_i^* = \underset{x_j \in C_i}{\text{argmin}} \| z_j - \mu_i \| $$ </tex-math></disp-formula></p>
        <p>where <italic>x<sub>j</sub></italic> signifies the original 24-hour load curve corresponding to the reduced data point <italic>z<sub>j</sub></italic>. The resulting set <inline-formula><tex-math id="M3">$$ \{x_1^*,\ldots,x_k^*\} $$</tex-math></inline-formula> effectively captures the characteristic variability of annual demand with reduced computational complexity. This extracted set provides a robust foundation for subsequent multi-objective dispatch optimization.</p>
        <p><xref ref-type="fig" rid="fig2">Figure 2A</xref> illustrates the cumulative explained variance as a function of the dimensionality of the retained principal subspace. The analysis reveals that a specific quantity of principal components is essential to sustain a cumulative variance threshold of at least 95%. This provides a rigorous theoretical justification for dimensionality reduction, ensuring that the intrinsic characteristics of the original data are preserved while simplifying the data structure. As demonstrated by the results, the cumulative explained variance exceeds 95% when the principal component count is established at four. This configuration ensures the retention of the majority of informational content without incurring significant computational overhead from excessive dimensionality. <xref ref-type="fig" rid="fig2">Figure 2B</xref> presents the load profiles of seven representative days extracted via the K-means clustering algorithm. Each individual curve characterizes the typical load pattern of a specific cluster, reflecting temporal variations over a 24-hour horizon. A comparative analysis across these clusters reveals distinct differences, particularly in the timing of peak loads and the magnitude of fluctuations. These representative profiles offer crucial insights for load forecasting and power system scheduling, facilitating the identification of diverse consumption patterns and their operational implications for the grid.</p>
        <fig id="fig2" position="float">
          <label>Figure 2</label>
          <caption>
            <p>Cluster results. (A) Cumulative explained variance; (B) Representative days load.</p>
          </caption>
          <graphic xlink:href="aiagent2001.fig.2.jpg"/>
        </fig>
        <p><xref ref-type="fig" rid="fig3">Figure 3A</xref> illustrates the distribution of the dataset within the reduced PCA space, projected onto the first two principal components. In this visualization, distinct colors signify the clustering results obtained via the K-means algorithm. The clear segregation of color-coded points demonstrates the effective identification of a robust clustering structure within the reduced-dimensional feature space. Furthermore, <xref ref-type="fig" rid="fig3">Figure 3B</xref> exemplifies the percentage of explained variance for the first four principal components in a bar chart format. Quantitative values for explained variance are explicitly annotated above each bar, facilitating a direct comparison of the relative contribution of each individual component. By evaluating the explained variance, the significance of each principal component in preserving the original data variability can be rigorously assessed. This analysis consequently determines the optimal dimensionality to retain during the feature extraction process.</p>
        <fig id="fig3" position="float">
          <label>Figure 3</label>
          <caption>
            <p>Cluster analysis. (A) PCA + K-means clustering (first two principal components); (B) Explanatory variance for the first four dimensions; (C) Load profiles for all days in each cluster; (D) Number of days in each cluster. PCA<italic>: </italic>Principal Component Analysis.</p>
          </caption>
          <graphic xlink:href="aiagent2001.fig.3.jpg"/>
        </fig>
        <p><xref ref-type="fig" rid="fig3">Figure 3C</xref> depicts the load profiles of all days within each cluster, with the representative day’s load profile highlighted in red. Transparent curves in different colors represent the load variations of individual days within the same cluster, while the red curve signifies the typical load pattern of the cluster. By comparing the load profiles within a cluster, the consistency of load patterns can be observed, and the red curve emphasizes the core characteristics of the cluster, and <xref ref-type="fig" rid="fig3">Figure 3D</xref> shows the size of the data in each cluster, and makes a special cluster for the anomalous data.</p>
      </sec>
      <sec id="sec2-2">
        <title>Renewable generation pattern identification: a high-fidelity DTW-DBSCAN model</title>
        <p>Traditional methods for extracting representative days from wind and solar power generation time series often utilize Euclidean distance-based clustering techniques, such as K-means or arithmetic averaging. Although computationally efficient, these methods possess notable limitations, specifically their reliance on linear temporal alignment and their inability to capture phase shifts, peak duration variations, or other nonlinear temporal distortions inherent in renewable energy profiles. In contrast, the DTW-DBSCAN framework excels at capturing local temporal distortions and can autonomously filter noise through density thresholds, thereby eliminating the need for subjective pre-specification of cluster counts. This study introduces a hybrid methodology that integrates DTW and DBSCAN. The proposed approach segments annual generation data into 24-hour daily subsequences on a seasonal basis, constructs a DTW-based shape similarity distance matrix to quantify morphological alignment, and performs unsupervised clustering via DBSCAN to categorize days with similar fluctuation patterns. Unlike conventional centroid-based methods that rely on Euclidean assumptions, representative days in this study are selected as real-world samples located closest to the DTW-geometric centroids within each cluster. Given a set of daily generation curves {<italic>x<sub>1</sub>,x<sub>2</sub>,…,x<sub>n</sub></italic>}, where each <inline-formula><tex-math id="M4">$$ x_i \in \mathbb{R}^T $$</tex-math></inline-formula> denotes the generation series for a day with <italic>T</italic> time intervals, the DTW distance between two sequences <italic>x<sub>i</sub></italic> and <italic>x<sub>j</sub></italic> is defined as follows:</p>
        <p><disp-formula> <label></label> <tex-math id="E3"> $$ DTW(x_i, x_j) = \min_{w \in W} \sum_{(p,q) \in w} \| x_i^p - x_j^q \| $$ </tex-math></disp-formula></p>
        <p>In this context, <italic>W</italic> denotes the set of all valid warping paths, while (<italic>p</italic>, <italic>q</italic>) represents the aligned index pair between sequences. A pairwise DTW distance matrix <inline-formula><tex-math id="M5">$$ D \in \mathbb{R}^{n \times n} $$</tex-math></inline-formula> is then constructed and utilized as input for the DBSCAN clustering algorithm. DBSCAN effectively categorizes days with similar temporal patterns without the necessity of specifying a predefined number of clusters. The algorithm identifies clusters based on a neighborhood radius <italic>ε</italic> and a minimum number of points minPts required to establish a dense region. For each resulting cluster <italic>C<sub>k</sub></italic>, the most representative day is identified as</p>
        <p><disp-formula> <label></label> <tex-math id="E4"> $$ x_k^* = \text{arg}\underset{x_i \in C_k}{\text{min}} \sum_{x_j \in C_k} DTW(x_i, x_j) $$ </tex-math></disp-formula></p>
        <p>The optimal selection process yields representative curves <inline-formula><tex-math id="M6">$$ \{x_1^*,\ldots,x_k^*\} $$</tex-math></inline-formula> by minimizing intra-cluster DTW distances, thereby effectively preserving the characteristic temporal patterns and inherent variability of wind and solar generation profiles. This methodology provides high-fidelity inputs for the planning and operational simulation of renewable-dominated energy systems, ensuring robustness against the temporal uncertainty and nonlinear dynamics intrinsic to wind and solar resources. The seasonal profiles for solar and wind power, along with their corresponding representative days, are illustrated in <xref ref-type="fig" rid="fig4">Figures 4</xref> and <xref ref-type="fig" rid="fig5">5</xref>, respectively. These profiles accurately capture the characteristic seasonal energy levels.</p>
        <fig id="fig4" position="float">
          <label>Figure 4</label>
          <caption>
            <p>Solar power generation profiles during the year. (A) Spring; (B) Summer; (C) Autumn; (D) Winter.</p>
          </caption>
          <graphic xlink:href="aiagent2001.fig.4.jpg"/>
        </fig>
        <fig id="fig5" position="float">
          <label>Figure 5</label>
          <caption>
            <p>Wind power generation profiles during the year. (A) Spring; (B) Summer; (C) Autumn; (D) Winter.</p>
          </caption>
          <graphic xlink:href="aiagent2001.fig.5.jpg"/>
        </fig>
        <p>The representative power generation profiles for solar and wind resources under seasonal variations are depicted in <xref ref-type="fig" rid="fig6">Figure 6A</xref> and <xref ref-type="fig" rid="fig6">B</xref>, respectively. Observations indicate that solar generation typically peaks during daylight hours in spring and summer, whereas wind power yields higher output during the day in spring and autumn and transitions to nighttime peaks during winter. These results demonstrate that renewable energy generation is not stochastic but adheres to predictable diurnal and seasonal cycles. The presence of these inherent temporal patterns provides a robust foundation for employing clustering methods to effectively identify and characterize typical operational scenarios.</p>
        <fig id="fig6" position="float">
          <label>Figure 6</label>
          <caption>
            <p>Diurnal power generation profiles across all seasons. (A) Solar power generation profiles; (B) Wind power generation profiles.</p>
          </caption>
          <graphic xlink:href="aiagent2001.fig.6.jpg"/>
        </fig>
      </sec>
      <sec id="sec2-3">
        <title>Hydrogen energy storage system model</title>
        <p>The HESS configuration investigated in this research comprises an electrolyzer, a hydrogen storage vessel, and a proton exchange membrane fuel cell (PEMFC). This system serves exclusively as an energy conversion and storage medium, where the electrolyzer converts surplus renewable generation into stored hydrogen. Conversely, the fuel cell regenerates electrical power by consuming the stored hydrogen during periods of supply deficit. In industrial practice, the energy conversion efficiency of hydrogen storage is influenced by various nonlinear parameters, including operating power, thermal conditions, and current density. While high-fidelity empirical models can precisely characterize these behaviors, they introduce substantial computational complexity and non-convexity to the optimization problem. To maintain mathematical tractability and computational efficiency, this study utilizes a linearized constant efficiency approximation. The operational efficiencies for the charging and discharging cycles are assumed to be fixed coefficients, represented by <italic>η<sub>H,c</sub></italic>  and <italic>η<sub>H,d</sub></italic>, respectively. For illustrative purposes, both efficiencies are assigned a value of 0.5<sup>[<xref ref-type="bibr" rid="B43">43</xref>]</sup>, resulting in an aggregate round-trip efficiency of 25%. To model the hydrogen storage model to fit the system, we specify the evolution of the State of Charge (SoC) model of the HESS as</p>
        <p><disp-formula> <label></label> <tex-math id="E5"> $$ E_{H,t+1} = E_{H,t} + \Delta t \cdot \left( \eta_{H,c} \cdot P_{H,c,t} - \frac{1}{\eta_{H,d}} \cdot P_{H,d,t} \right) $$ </tex-math></disp-formula></p>
        <p>where E<italic><sub>H,t</sub></italic> is the hydrogen storage state of charge at time <italic>t</italic>, <italic>P<sub>H,c,t</sub></italic> and <italic>P<sub>H,d,t</sub></italic> are the charging and discharging power, representing the power input to the electrolyzer and the power output from the fuel cell, respectively, <italic>η<sub>H,c</sub></italic> and <italic>η<sub>H,d</sub></italic> are the constant charging and discharging efficiencies, Δ<italic>t</italic> is the duration of the time interval. The discharging term is expressed as <italic>P<sub>H,d,t</sub></italic>/<italic>η<sub>H,d</sub></italic> because the electrical energy output from the fuel cell is only a fraction of the actual energy extracted from the hydrogen storage. Specifically, to deliver <italic>P<sub>H,d,t</sub></italic> units of electricity to the system, the fuel cell must consume <italic>P<sub>H,d,t</sub></italic>/<italic>η<sub>H,d</sub></italic> units of hydrogen energy.</p>
        <p>This simplified model provides a practical balance between modeling fidelity and computational efficiency, and is particularly suitable for long-term scheduling, where large-scale dispatch problems must be solved repeatedly with limited computational resources.</p>
      </sec>
      <sec id="sec2-4">
        <title>Efficient optimal scheduling method for wind-solar-hydrogen systems</title>
        <p>Optimizing the dispatch strategy for hybrid systems to achieve socio-economic benefits calls for a comprehensive consideration of grid requirements, maximization of renewable energy penetration, and minimization of carbon emission costs. With the advancement of distributed reinforcement learning (DRL), distributed frameworks have demonstrated significant efficacy in large-scale optimization tasks. As illustrated in <xref ref-type="fig" rid="fig7">Figure 7</xref>, the fundamental architecture of these algorithms typically comprises three core components: the Actor, the Learner, and the Coordinator. The Actor interacts with the environment to generate experience data, the Learner processes this data to update policy network parameters, and the Coordinator facilitates seamless communication between these components to ensure efficient data transfer. Parallel execution across multiple Actors significantly enhances data throughput, thereby providing a robust data foundation for training complex decision-making policies. In light of the temporal uncertainties inherent in wind and solar generation, this study proposes a multi-faceted power management policy tailored for wind-solar-hydrogen systems. The approach leverages deep reinforcement learning to govern an autonomous agent that maximizes cumulative rewards through continuous interaction with its environment. This decision-making process is guided by optimized policies that determine the most effective actions based on the current system states.</p>
        <fig id="fig7" position="float">
          <label>Figure 7</label>
          <caption>
            <p>Basic framework of distributed reinforcement learning.</p>
          </caption>
          <graphic xlink:href="aiagent2001.fig.7.jpg"/>
        </fig>
        <p>In DRL, two common paradigms exist: synchronous and asynchronous training. In synchronous algorithms, all actor processes must wait for global policy updates before continuing their interactions with the environment. This strict synchronization often leads to idle time, as actors or learners must pause until others complete their tasks, resulting in reduced system efficiency. In contrast, asynchronous algorithms allow actors to interact with their environments and update local models independently, without waiting for global synchronization. This approach significantly improves data throughput and computational efficiency, making it especially suitable for large-scale and complex tasks such as energy dispatch optimization in wind-solar-hydrogen systems. The distributed extension of the DDPG algorithm adopts this parallelism by enabling multiple actor processes to interact with independent environments simultaneously, each maintaining a local policy <inline-formula><tex-math id="M7">$$ \mu_i(s|\theta_i^\mu) $$</tex-math></inline-formula> and value function <inline-formula><tex-math id="M8">$$ Q_i(s,a|\theta_i^Q) $$</tex-math></inline-formula>. To ensure consistency, these local parameters are periodically aggregated to update global networks. The aggregation rule averages the parameters across all actors:</p>
        <p><disp-formula> <label></label> <tex-math id="E6"> $$ \theta^\mu \leftarrow \frac{1}{N} \sum_{i=1}^N \theta_i^\mu, \quad \theta^Q \leftarrow \frac{1}{N} \sum_{i=1}^N \theta_i^Q $$ </tex-math></disp-formula></p>
        <p>where <italic>N</italic> denotes the number of distributed actors. Each actor samples experiences (<italic>s<sub>i</sub></italic>,<italic>a<sub>i</sub></italic>,<italic>r<sub>i</sub></italic>,<italic>s'<sub>i</sub></italic>) and contributes to a shared replay buffer <inline-formula><tex-math id="M9">$$ D = \bigcup_{i=1}^N D_i $$</tex-math></inline-formula>, which the learner uses to train global networks. The distributed framework significantly improves data throughput and training stability. In asynchronous implementations, actors continue to collect data without waiting for global parameter updates, further enhancing scalability for large-scale energy dispatch optimization. In the distributed implementation, a centralized-learner and multi-actor architecture is facilitated by the MATLAB Parallel Server. Specifically, three parallel actor nodes are deployed, each assigned to a distinct segment of the annual load-pattern dataset and maintaining a local copy of the target policy network to interact with its respective environmental instance. To balance computational efficiency with training stability, the synchronization and aggregation interval is established at <italic>N<sub>sync</sub></italic>=10. Consequently, the system enforces a global synchronization every 10 parameter iterations by broadcasting the latest Actor network weights to all parallel nodes. This mechanism ensures that each actor receives timely feedback from the global policy, effectively mitigating the gradient lag phenomenon inherent in DRL. Distributed storage and asynchronous update mechanisms further optimize training efficiency. The update rule for the online Q-network is formulated as</p>
        <p><disp-formula> <label></label> <tex-math id="E7"> $$ \theta^Q \leftarrow \theta^Q + \alpha_Q \delta \cdot \nabla_{\theta^{Q}} Q(s,a|\theta^Q) $$ </tex-math></disp-formula></p>
        <p>Here, <italic>θ<sup>Q</sup></italic> denotes the parameters of the online Q-network, while <italic>α<sub>Q</sub></italic> represents the learning rate. The temporal difference error is symbolized by <italic>δ</italic>. The term <inline-formula><tex-math id="M10">$$ \nabla_{\theta^{Q}} Q(s,a|\theta^Q) $$</tex-math></inline-formula> refers to the gradient of the Q-value function with respect to the network parameters, characterizing how variations in <italic>θ<sup>Q</sup></italic> impact the Q-value. This gradient directs the parameter updates to minimize the temporal difference error <italic>δ</italic>. The online policy network is updated as</p>
        <p><disp-formula> <label></label> <tex-math id="E8"> $$ \theta^\mu \leftarrow \theta^\mu + \alpha_\mu \nabla_{\theta^\mu} \mu(s|\theta^\mu) \nabla_a Q(s,a|\theta^Q) \bigg|_{a=\mu(s|\theta^\mu)} $$ </tex-math></disp-formula></p>
        <p>In this context, <italic>θ<sup>μ</sup></italic> represents the parameters of the online policy network, whereas <italic>α<sub>μ</sub></italic> denotes the associated learning rate. The expression <inline-formula><tex-math id="M11">$$ \nabla_{\theta^\mu} \mu(s|\theta^\mu) \nabla_a Q(s,a|\theta^Q) \bigg|_{a=\mu(s|\theta^\mu)} $$</tex-math></inline-formula> signifies the gradient of the expected Q-value with respect to the policy network parameters, computed via the deterministic policy gradient theorem. This gradient provides the direction for updating <italic>θ<sup>μ</sup></italic> to maximize the expected return. The target networks are updated as</p>
        <p><disp-formula> <label></label> <tex-math id="E9"> $$ \left\{\begin{array}{l}
\theta^{Q^{\prime}} \leftarrow \tau \theta^{Q}+(1-\tau) \theta^{Q^{\prime}} \\
\theta^{u^{\prime}} \leftarrow \tau \theta^{\mu}+(1-\tau) \theta^{u^{\prime}}
\end{array}\right.
 $$ </tex-math></disp-formula></p>
        <p>where <italic>τ</italic> denotes the soft update rate, typically a small constant significantly less than 1, which governs the update frequency of the target networks. The symbols <italic>θ<sup>Q'</sup></italic> and <italic>θ<sup>μ'</sup></italic> designate the parameters of the target Q-network and the target policy network, respectively. By adjusting the target networks incrementally, this soft update mechanism enhances training stability and facilitates convergence.</p>
        <p>The agent-environment interaction in the DDPG algorithm’s reinforcement learning framework is formally modeled as a Markov Decision Process (MDP), represented by the five-tuple (<italic>s</italic>, <italic>a</italic>, <italic>r</italic>, <italic>p</italic>, <italic>γ</italic>). The state <italic>s</italic> encapsulates comprehensive environmental information that serves as input to the neural network, while the action <italic>a</italic> constitutes the network’s output that directly influences and modifies the environment. The reward signal <italic>r</italic> provides the optimization direction for the neural network, driving parameter updates toward higher expected returns. The state transition probability <italic>p</italic>, representing the environment’s intrinsic physical dynamics, determines the likelihood of subsequent state transitions. The discount factor <italic>γ</italic> governs the temporal value perspective, ensuring that the agent balances immediate operational costs with the long-term flexibility of energy storage over the scheduling cycle. Each of these fundamental MDP components will be rigorously defined and mathematically characterized in the subsequent subsections.</p>
      </sec>
      <sec id="sec2-5">
        <title>State and action space definitions</title>
        <p>The DDPG algorithm is implemented to derive an optimal control policy within real-valued state and action domains. The reinforcement learning architecture primarily comprises two interacting entities: the environment and the agent. At each discrete time step <italic>t</italic>, the agent perceives the current system state <italic>s</italic>(<italic>t</italic>) and subsequently determines an action <italic>a</italic>(<italic>t</italic>)=[<italic>P<sub>t</sub></italic>(<italic>t</italic>)] within the bounded action space [-Δ<italic>P<sub>t</sub></italic>,Δ<italic>P<sub>t</sub></italic>] (MWh). To fulfill the prescribed control objectives, the state vector s(<italic>t</italic>) must encapsulate comprehensive environmental information necessary for informing effective decision-making. The constituent variables of the state space are defined as follows:</p>
        <p><disp-formula> <label></label> <tex-math id="E10"> $$ s(t) = \left[ P_t(t),\ P_w(t),\ P_s(t),\ P_e(t),\ P_{bsc}(t),\ P_{dem}(t) \right] $$ </tex-math></disp-formula></p>
        <p>Within this framework, <italic>P<sub>t</sub></italic>(<italic>t</italic>) signifies the active power output of the thermal unit at time <italic>t</italic>, which fluctuates in response to operational requirements and demand variations. The variables <italic>P<sub>w</sub></italic>(<italic>t</italic>) and <italic>P<sub>s</sub></italic>(<italic>t</italic>) characterize the power generation from wind and solar energy sources at time <italic>t</italic>, respectively. Furthermore, <italic>P<sub>e</sub></italic>(<italic>t</italic>) denotes the quantity of external power acquired at time <italic>t</italic> to compensate for the gaps in the thermal unit’s generation. The term <italic>P<sub>bsc</sub></italic>(<italic>t</italic>) identifies the power contribution from the HESS, while <italic>P<sub>dem</sub></italic>(<italic>t</italic>) represents the instantaneous load demand.</p>
        <p>To explicitly clarify the system dynamics, the variables within this framework are categorized into exogenous and controllable variables. The exogenous (uncontrollable) variables include the wind power output <italic>P<sub>w</sub></italic>(<italic>t</italic>), solar power output <italic>P<sub>s</sub></italic>(<italic>t</italic>), and regional electrical load demand <italic>P<sub>dem</sub></italic>(<italic>t</italic>), which are fundamentally driven by meteorological and stochastic operational processes. Conversely, the controllable variables encompass the active power output of the thermal unit <italic>P<sub>t</sub></italic>(<italic>t</italic>), the electrical power exchanged with the external grid P<italic><sub>e</sub></italic>(<italic>t</italic>), and the power contribution from the hydrogen energy storage system <italic>P<sub>bsc</sub></italic>(<italic>t</italic>). Specifically, the agent directly dictates the dispatch of the thermal unit, while the grid exchange and HESS operations dynamically ensure system-wide load balance. To ensure a high-fidelity simulation of thermal power dynamics, the rate of change in <italic>P<sub>t</sub></italic>(<italic>t</italic>) is strictly governed by the thermal unit’s maximum allowable ramp rate. In instances where the thermal power output exceeds these operational boundaries, the current training episode is terminated.</p>
      </sec>
      <sec id="sec2-6">
        <title>Reward function design</title>
        <p>The optimization of the energy dispatch strategy for thermal generation is primarily aimed at securing economic advantages for both societal and industrial stakeholders. Consequently, it is imperative to satisfy system load requirements, maximize the penetration of renewable energy sources, and minimize total operational expenditures. This research utilizes thermal power units and energy storage systems as synergistic components to facilitate the seamless integration of wind and solar power. Furthermore, a carbon trading mechanism is integrated into the dispatch framework, simultaneously addressing coal consumption costs and CO<sub>2</sub> emission levels. Such an integrative approach seeks to establish a harmonized equilibrium between economic performance and environmental sustainability through carbon reduction.</p>
        <p>(1) <bold>Coal fuel cost:</bold> To minimize the expenditures associated with coal fuel consumption, the cost function <italic>r</italic><sub>1</sub> is formulated as follows:</p>
        <p><disp-formula> <label></label> <tex-math id="E11"> $$ r_1(t) = \text{Y}_c \cdot G(P_t(t)) $$ </tex-math></disp-formula></p>
        <p><disp-formula> <label></label> <tex-math id="E12"> $$ G(P_t(t)) = a_1 \left( P_t(t) \right)^2 + b_1 (P_t(t)) + c_1 $$ </tex-math></disp-formula></p>
        <p>where each time step <italic>t</italic> corresponds to a specific interval for decision-making. Y<sub>c</sub> is the total coal combustion cost, set at 115 $/MWh, and serves as the basis for evaluating the economic impact of fuel consumption. The coal consumption function <italic>G</italic>(<italic>P<sub>t</sub></italic>(<italic>t</italic>)) estimates the coal usage of the thermal unit based on its power output <italic>P<sub>t</sub></italic>(<italic>t</italic>), the parameters <italic>a</italic><sub>1</sub>, <italic>b</italic><sub>1</sub>, and <italic>c</italic><sub>1</sub> are used in computing the thermal unit’s output and corresponding coal consumption. Since non-fossil energy sources do not directly emit CO<sub>2</sub>, the model includes carbon costs only for thermal power generation.</p>
        <p>(2) <bold>Carbon emission expenditure:</bold> Since non-fossil energy sources produce zero direct emissions, the model allocates carbon-related expenditures solely to the thermal power component. The total volume of CO<sub>2</sub> emissions is quantified as follows:</p>
        <p><disp-formula> <label></label> <tex-math id="E13"> $$ M(P_t(t)) = a_2 \left( P_t(t) \right)^2 + b_2 (P_t(t)) + c_2 $$ </tex-math></disp-formula></p>
        <p>Here, <italic>M</italic>(<italic>P<sub>t</sub></italic>(<italic>t</italic>)) signifies the CO<sub>2</sub> emissions associated with the thermal unit output <italic>P<sub>t</sub></italic>(<italic>t</italic>) at time <italic>t</italic>. The emission volume is determined via parameters <italic>a</italic><sub>2</sub>, <italic>b</italic><sub>2</sub>, and <italic>c</italic><sub>2</sub>, which mathematically characterize the emission characteristics of the specific unit. Accounting for carbon market mechanisms, this study assumes that a predefined quota of CO<sub>2</sub> emission allowances is allocated gratis. These free-of-charge allowances are determined by the generation capacity and the specific category of the thermal unit, formulated as follows:</p>
        <p><disp-formula> <label></label> <tex-math id="E14"> $$ M_{fr}(t) = \beta P_t(t) \Delta t $$ </tex-math></disp-formula></p>
        <p>where <italic>M<sub>fr</sub></italic>(<italic>t</italic>) represents the free CO<sub>2</sub> emission allowance allocated to the thermal unit at time <italic>t</italic>. The coefficient <italic>β</italic> defines the amount of allowance granted per unit of electricity generated. Under this allocation scheme, any emissions exceeding the free allowance must be offset through the purchase of additional allowances, resulting in a CO<sub>2</sub> emission cost. To penalize excessive emissions and encourage low-carbon generation, the CO<sub>2</sub> emission cost function <italic>r</italic><sub>2</sub>(<italic>t</italic>) is defined as</p>
        <p><disp-formula> <label></label> <tex-math id="E15"> $$ M_{sp}(t) = M(P_t(t)) - M_{fr}(t) $$ </tex-math></disp-formula></p>
        <p><disp-formula> <label></label> <tex-math id="E16"> $$ r_2(t) = \begin{cases}
\varphi \cdot M_{sp}(t), &amp; \text{if } M_{sp}(t) > 0 \\
0, &amp; \text{otherwise}
\end{cases}
 $$ </tex-math></disp-formula></p>
        <p>where <italic>M<sub>sp</sub></italic>(<italic>t</italic>) denotes the payable CO<sub>2</sub> emissions at time <italic>t</italic>, representing the amount by which actual emissions exceed the allocated free allowance. The carbon price is denoted by <italic>φ</italic>, which is set to 11.5 $/ton<sup>[<xref ref-type="bibr" rid="B44">44</xref>]</sup>. This price is used to calculate the monetary cost of excess emissions within the carbon trading framework.</p>
        <p>(3) <bold>Electricity purchase cost:</bold> To ensure that the power supply meets the load demand, an electricity purchase cost is introduced based on the power deficit. The total power is given by</p>
        <p><disp-formula> <label></label> <tex-math id="E17"> $$ P(t) = P_t(t) + P_w(t) + P_s(t). $$ </tex-math></disp-formula></p>
        <p>If an energy storage system is considered, <italic>P</italic>(<italic>t</italic>) also includes the power supplied by storage, <italic>P<sub>bsc</sub></italic>(<italic>t</italic>). The electricity purchase cost <italic>r</italic><sub>3</sub>(<italic>t</italic>) is defined as</p>
        <p><disp-formula> <label></label> <tex-math id="E18"> $$ r_3(t) = \begin{cases}
Y_e \cdot \left( P_{dem}(t) - P(t) \right), &amp; \text{if } P_{dem}(t) > P(t) \\
0, &amp; \text{otherwise}
\end{cases}
 $$ </tex-math></disp-formula></p>
        <p>Here, Y<italic><sub>e</sub></italic> denotes the electricity purchase price, established at 327 $/MWh, which serves as a benchmark for comparing external grid procurement costs with internal generation expenses. The term Δ<italic>t</italic> represents the dispatch interval, fixed at 1 h, which determines the temporal resolution of the simulation model. Throughout the interaction process, the agent acquires rewards by progressively modifying the environmental state. The instantaneous reward <italic>r</italic>(<italic>t</italic>) is formulated as follows:</p>
        <p><disp-formula> <label></label> <tex-math id="E19"> $$ r(t) = -(r_1(t) + r_2(t) + r_3(t)). $$ </tex-math></disp-formula></p>
        <p>The control policy is continuously updated by the agent based on the value of the reward function. Given that the energy management of the wind-solar-hydrogen system involves time-coupled variables, such as the SoC of the HESS, the scheduling task is formulated as a finite-horizon MDP. Accordingly, the discount factor <italic>γ</italic> is set to 1. This configuration ensures that the agent yields an unbiased cumulative reward over the entire daily scheduling cycle, effectively balancing immediate operational costs with the long-term flexibility of energy storage.</p>
      </sec>
      <sec id="sec2-7">
        <title>Binding conditions</title>
        <p>To guarantee the stable and reliable operation of the integrated wind-solar-hydrogen system, several physical hard constraints must be strictly adhered to. The mathematical formulations of these constraints are presented as follows:</p>
        <p>(1) <bold>Load balance constraint:</bold> To ensure load balance at each time <italic>t</italic>, the total power supply must be no less than the load demand. The supply includes wind power <italic>P<sub>w</sub></italic>(<italic>t</italic>), solar power <italic>P<sub>s</sub></italic>(<italic>t</italic>), purchased electricity <italic>P<sub>e</sub></italic>(<italic>t</italic>), storage discharge power <italic>P<sub>bdc</sub></italic><sub>h</sub>(<italic>t</italic>), and subtracts storage charging power <italic>P<sub>bc</sub></italic><sub>h</sub>(<italic>t</italic>). The load balance constraint is given by</p>
        <p><disp-formula> <label></label> <tex-math id="E20"> $$ P_t(t) + P_w(t) + P_s(t) + P_e(t) + P_{bdch}(t) - P_{bch}(t) \geq P_{dem}(t). $$ </tex-math></disp-formula></p>
        <p>(2) <bold>Climbing capacity constraint: </bold>To accurately replicate the operational behavior of an actual thermal power unit, a ramp rate limitation is imposed on the power output fluctuations. The governing climbing capacity constraint is formulated as follows:</p>
        <p><disp-formula> <label></label> <tex-math id="E21"> $$ -\Delta P_t \leq (P_t(t+\Delta t) - P_t(t)) / \Delta t \leq \Delta P_t $$ </tex-math></disp-formula></p>
        <p>Here, Δ<italic>P<sub>t</sub></italic> signifies the ramp rate limitation of the thermal power generating unit. This parameter establishes the maximum permissible increment or decrement in power output within a discrete time interval.</p>
        <p>(3) <bold>Generation output constraints:</bold> The active power outputs produced by the thermal, solar, and wind units are strictly governed by their individual rated capacities. These operational boundaries are mathematically defined as follows:</p>
        <p><disp-formula> <label></label> <tex-math id="E22"> $$ \begin{cases}
0 \leq P_t(t) \leq P_{t,\max} \\
0 \leq P_w(t) \leq P_{w,\max} \\
0 \leq P_s(t) \leq P_{s,\max}
\end{cases}
 $$ </tex-math></disp-formula></p>
        <p>In this formulation, <italic>P<sub>t</sub></italic><sub>,max</sub> signifies the maximum generation capacity assigned to the thermal power unit. Analogously, <italic>P<sub>w</sub></italic><sub>,max</sub> and <italic>P<sub>s</sub></italic><sub>,max</sub> designate the peak output limitations for the wind and solar generating units, respectively.</p>
        <p>(4) <bold>Storage capacity constraints:</bold> The operational capacity of the energy storage system is governed by specific physical thresholds to ensure safety and longevity. These capacity limitations are mathematically formulated as follows:</p>
        <p><disp-formula> <label></label> <tex-math id="E23"> $$ \begin{cases}
P_{bsc}(t-1) - P_{bch}(t) + P_{bch}(t) = P_{bsc}(t) \\
0 \leq P_{bsc}(t) \leq P_{bsc,\max}
\end{cases}
 $$ </tex-math></disp-formula></p>
        <p>In this formulation, <italic>P<sub>bsc</sub></italic>(<italic>t</italic>) denotes the power supplied by the HESS at time <italic>t</italic>. As defined above, <italic>P<sub>bc</sub></italic><sub>h</sub>(<italic>t</italic>) and <italic>P<sub>bdc</sub></italic><sub>h</sub>(<italic>t</italic>) represent the charging and discharging power of the HESS at time <italic>t</italic>, respectively.</p>
        <p>(5) <bold>Charging and discharging constraints:</bold> The operational logic presiding over the charging and discharging cycles of the energy storage system is articulated as follows:</p>
        <p><disp-formula> <label></label> <tex-math id="E24"> $$ \begin{cases}
P_{bch}(t)\Delta t + P_{bsc}(t-1) \leq P_{bsc,\max} \\
P_{bdch}(t)\Delta t - P_{bsc}(t-1) \leq 0 \\
0 \leq P_{bch}(t) \leq P_{bch,\max} \\
0 \leq P_{bdch}(t) \leq P_{bdch,\max}
\end{cases}
 $$ </tex-math></disp-formula></p>
        <p>where <italic>P<sub>bdc</sub></italic><sub>h,max</sub> and <italic>P<sub>bc</sub></italic><sub>h,max</sub> denote the maximum discharging and charging power of the energy storage system, respectively, <italic>P<sub>bsc</sub></italic><sub>,max</sub> represents the maximum energy storage capacity of the system.</p>
        <p>The neural network takes power system data as input and outputs adjustment values for thermal power generation. Through this continuous input-output interaction, the operation of the thermal unit is progressively optimized, improving overall generation efficiency within each sampling interval.</p>
      </sec>
    </sec>
    <sec id="sec3">
      <title>RESULTS AND DISCUSSION</title>
      <p>This study utilizes annual time-series data for electricity load, wind power, and solar generation, sourced from the Open Power Systems Data platform<sup>[<xref ref-type="bibr" rid="B45">45</xref>]</sup>. The dataset provides hourly-resolution information essential for power system modeling, encompassing electricity prices, load profiles, and renewable generation capacities aggregated across various European bidding zones and neighboring regions. For the simulation, seven representative days are extracted from the annual records to capture seasonal variations: spring (Days 2 and 3), summer (Days 4 and 5), autumn (Day 6), and winter (Days 1 and 7).</p>
      <p>To facilitate a comprehensive comparative analysis, ten distinct experimental scenarios are defined within this research, with their respective scheduling profiles illustrated in <xref ref-type="fig" rid="fig8">Figure 8</xref>. Scenario 1 utilizes the Sequential Least Squares Programming (SLSQP) algorithm in the absence of an energy storage system. Scenario 2 implements the SLSQP algorithm integrated with a hydrogen-based energy storage system. Scenario 3 adopts the Deep Deterministic Policy Gradient (DDPG) optimization framework without storage capabilities. Scenario 4 applies the DDPG algorithm coupled with a hydrogen-based storage system. Scenario 5 employs a DRL approach without energy storage. Scenario 6 integrates the DRL algorithm with a hydrogen-based storage system. Scenario 7 extends the configuration of Scenario 6 by incorporating wind and solar generation data processed through an First-order Autoregressive [AR(1)] noise model. Scenarios 8, 9, and 10 are specifically designed to conduct sensitivity analyses based on the architectural setup of Scenario 7. In the baseline configuration, uniform weights of unity are assigned to the fuel consumption cost, carbon emission penalty, and electricity procurement expenditure. Scenario 8 assesses the impact of fuel price volatility by doubling the weight attributed to coal fuel consumption. Scenario 9 simulates more stringent environmental regulations by increasing the carbon emission weight to 2. Finally, Scenario 10 evaluates the sensitivity to market price fluctuations by adjusting the electricity purchase weight to 2. To address multi-objective energy dispatch within continuous action domains, the DDPG algorithm leverages an Actor-Critic architectural framework. Within this structure, the Actor network <inline-formula><tex-math id="M12">$$ \mu(s|\theta^\mu) $$</tex-math></inline-formula> proposes control actions, while the Critic network <inline-formula><tex-math id="M13">$$ Q(s, a | \theta^Q) $$</tex-math></inline-formula> evaluates the effectiveness of these decisions. Detailed system parameters utilized in this study are summarized in <xref ref-type="table" rid="t1">Table 1</xref>.</p>
      <fig id="fig8" position="float">
        <label>Figure 8</label>
        <caption>
          <p>Experiment result. (A) System power scheduling profile for Scenario 1; (B) System power scheduling profile for Scenario 2; (C) System power scheduling profile for Scenario 3; (D) System power scheduling profile for Scenario 4; (E) System power scheduling profile for Scenario 5; (F) System power scheduling profile for Scenario 6; (G) System power scheduling profile for Scenario 7; (H) System power scheduling profile for Scenario 8; (I) System power scheduling profile for Scenario 9; (J) System power scheduling profile for Scenario 10.</p>
        </caption>
        <graphic xlink:href="aiagent2001.fig.8.jpg"/>
      </fig>
      <table-wrap id="t1">
        <label>Table 1</label>
        <caption>
          <p>Systems parameters</p>
        </caption>
        <table frame="hsides" rules="groups">
  <tbody>
    <tr>
      <td>
        <bold>Parameter</bold>
      </td>
      <td>
        <bold>Value</bold>
      </td>
      <td>
        <bold>Parameter</bold>
      </td>
      <td>
        <bold>Value</bold>
      </td>
      <td>
        <bold>Parameter</bold>
      </td>
      <td>
        <bold>Value</bold>
      </td>
    </tr>
    <tr>
      <td>Y<italic><sub>c</sub></italic> ($/MWh)</td>
      <td>115</td>
      <td>Y<italic><sub>e</sub></italic> ($/MWh)</td>
      <td>327</td>
      <td>P<italic><sub>t</sub></italic><sub>,max</sub> (MWh)</td>
      <td>1 × 10<sup>4</sup></td>
    </tr>
    <tr>
      <td>
        <italic>a</italic>
        <sub>1</sub>
      </td>
      <td>1 × 10<sup>-5</sup></td>
      <td>
        <italic>a</italic>
        <sub>2</sub>
      </td>
      <td>2 × 10<sup>-5</sup></td>
      <td>P<italic><sub>w</sub></italic><sub>,max</sub> (MWh)</td>
      <td>2,000</td>
    </tr>
    <tr>
      <td>
        <italic>b</italic>
        <sub>1</sub>
      </td>
      <td>1 × 10<sup>-4</sup></td>
      <td>
        <italic>b</italic>
        <sub>2</sub>
      </td>
      <td>2 × 10<sup>-4</sup></td>
      <td>P<italic><sub>s</sub></italic><sub>,max</sub> (MWh)</td>
      <td>1,500</td>
    </tr>
    <tr>
      <td>
        <italic>c</italic>
        <sub>1</sub>
      </td>
      <td>0</td>
      <td>
        <italic>c</italic>
        <sub>2</sub>
      </td>
      <td>600</td>
      <td>P<italic><sub>bsc</sub></italic><sub>,max</sub> (MWh)</td>
      <td>3,000</td>
    </tr>
    <tr>
      <td>
        <italic>l</italic>
      </td>
      <td>2 × 10<sup>6</sup></td>
      <td>
        <italic>β</italic>
      </td>
      <td>0.1</td>
      <td>P<italic><sub>bc</sub></italic><sub>h,max</sub> (MWh)</td>
      <td>300</td>
    </tr>
    <tr>
      <td><italic>φ</italic> ($/ton)</td>
      <td>11.5</td>
      <td>
        <italic>k</italic>
      </td>
      <td>24</td>
      <td>P<italic><sub>bdc</sub></italic><sub>h,max</sub> (MWh)</td>
      <td>300</td>
    </tr>
    <tr>
      <td>Actor learning rate</td>
      <td>1 × 10<sup>-4</sup></td>
      <td>Critic learning rate</td>
      <td>0.001</td>
      <td>Gradient threshold</td>
      <td>1</td>
    </tr>
    <tr>
      <td>L2 Regularization</td>
      <td>1 × 10<sup>-4</sup></td>
      <td>Noise variance</td>
      <td>0.6</td>
      <td>Decay rate</td>
      <td>1 × 10<sup>-5</sup></td>
    </tr>
    <tr>
      <td>Replay buffer size</td>
      <td>1 × 10<sup>6</sup></td>
      <td>Max training episodes</td>
      <td>1 × 10<sup>4</sup></td>
      <td/>
      <td/>
    </tr>
  </tbody>
</table>
      </table-wrap>
      <sec id="sec3-1">
        <title>Performance validation</title>
        <p>A detailed analysis of the economic performance across the ten scenarios is presented in <xref ref-type="table" rid="t2">Table 2</xref>. The results systematically assess the impact of different optimization algorithms, the inclusion of a HESS, and varying cost sensitivities on the system’s operational strategy and total cost.</p>
        <table-wrap id="t2">
          <label>Table 2</label>
          <caption>
            <p>Comparative evaluation of operational costs across different scenarios</p>
          </caption>
          <table frame="hsides" rules="groups">
  <tbody>
    <tr>
      <td>
        <bold>Scenario</bold>
      </td>
      <td>
        <bold>Coal fuel cost ($) (<italic>r</italic><sub>1</sub>)</bold>
      </td>
      <td>
        <bold>Carbon emission cost <break/>($) (<italic>r</italic><sub>2</sub>)</bold>
      </td>
      <td>
        <bold>External power purchase cost ($) (<italic>r</italic><sub>3</sub>)</bold>
      </td>
      <td>
        <bold>Aggregate operational cost ($)</bold>
      </td>
    </tr>
    <tr>
      <td>1</td>
      <td>2.986 × 10<sup>7</sup></td>
      <td>2.520 × 10<sup>7</sup></td>
      <td>1.925 × 10<sup>6</sup></td>
      <td>5.696 × 10<sup>7</sup></td>
    </tr>
    <tr>
      <td>2</td>
      <td>2.789 × 10<sup>7</sup></td>
      <td>2.316 × 10<sup>7</sup></td>
      <td>5.627 × 10<sup>6</sup></td>
      <td>5.668 × 10<sup>7</sup></td>
    </tr>
    <tr>
      <td>3</td>
      <td>2.702 × 10<sup>7</sup></td>
      <td>2.335 × 10<sup>7</sup></td>
      <td>5.317 × 10<sup>6</sup></td>
      <td>5.569 × 10<sup>7</sup></td>
    </tr>
    <tr>
      <td>4</td>
      <td>2.932 × 10<sup>7</sup></td>
      <td>2.398 × 10<sup>7</sup></td>
      <td>2.159 × 10<sup>6</sup></td>
      <td>5.546 × 10<sup>7</sup></td>
    </tr>
    <tr>
      <td>5</td>
      <td>2.629 × 10<sup>7</sup></td>
      <td>2.305 × 10<sup>7</sup></td>
      <td>5.334 × 10<sup>6</sup></td>
      <td>5.467 × 10<sup>7</sup></td>
    </tr>
    <tr>
      <td>6</td>
      <td>2.823 × 10<sup>7</sup></td>
      <td>2.246 × 10<sup>7</sup></td>
      <td>2.910 × 10<sup>6</sup></td>
      <td>5.360 × 10<sup>7</sup></td>
    </tr>
    <tr>
      <td>7</td>
      <td>2.626 × 10<sup>7</sup></td>
      <td>2.287 × 10<sup>7</sup></td>
      <td>4.654 × 10<sup>6</sup></td>
      <td>5.379 × 10<sup>7</sup></td>
    </tr>
    <tr>
      <td>8</td>
      <td>2.607 × 10<sup>7</sup></td>
      <td>2.224 × 10<sup>7</sup></td>
      <td>6.376 × 10<sup>6</sup></td>
      <td>5.469 × 10<sup>7</sup></td>
    </tr>
    <tr>
      <td>9</td>
      <td>2.418 × 10<sup>7</sup></td>
      <td>2.151 × 10<sup>7</sup></td>
      <td>8.909 × 10<sup>6</sup></td>
      <td>5.460 × 10<sup>7</sup></td>
    </tr>
    <tr>
      <td>10</td>
      <td>2.965 × 10<sup>7</sup></td>
      <td>2.490 × 10<sup>7</sup></td>
      <td>4.231 × 10<sup>5</sup></td>
      <td>5.497 × 10<sup>7</sup></td>
    </tr>
  </tbody>
</table>
        </table-wrap>
        <p>Firstly, a comparison between scenarios with and without the HESS reveals its significant economic benefits. For instance, comparing Scenario 1 (SLSQP without HESS) to Scenario 2 (SLSQP with HESS), the total cost is marginally reduced. Similarly, the introduction of the HESS in the DDPG algorithm (Scenario 4 <italic>vs.</italic> Scenario 3) and the DRL algorithm (Scenario 6 <italic>vs.</italic> Scenario 5) leads to total cost reductions of 0.023 × 10<sup>7</sup> and 0.107 × 10<sup>7</sup> $ respectively. This consistently demonstrates that integrating the HESS improves energy utilization and reduces overall operational costs, regardless of the optimization algorithm used.</p>
        <p>Secondly, among the different optimization strategies, the DRL algorithm combined with HESS (Scenario 6) achieves the lowest total cost of 5.360 × 10<sup>7</sup> $. Comparing Scenario 6 with Scenario 5 explicitly demonstrates the mechanism behind this improvement: although the integration of HESS leads to a slightly higher coal fuel cost (2.823 × 10<sup>7</sup> <italic>vs.</italic> 2.629 × 10<sup>7</sup> $), this controlled increase in steady thermal generation enables the system to drastically scale back on costly external power purchases (2.910 × 10<sup>6</sup> <italic>vs.</italic> 5.334 × 10<sup>6</sup> $). Furthermore, the energy time-shifting capabilities reduce reliance on high-emission peaking periods, lowering overall carbon emission costs (2.246 × 10<sup>7</sup> <italic>vs.</italic> 2.305 × 10<sup>7</sup> $). This favorable trade-off confirms it is the most effective strategy for balancing coal fuel consumption, CO<sub>2</sub> emissions, and electricity purchasing to achieve economic optimality. Scenario 7, which introduces noise to the renewable generation data, results in a slightly higher total cost (5.379 × 10<sup>7</sup> $) than Scenario 6, reflecting the economic trade-offs required to manage system operations under uncertainty.</p>
        <p>Finally, the sensitivity analysis (Scenarios 8-10), based on the optimal framework of Scenario 7, provides critical insights into the system’s response to different cost priorities. In Scenario 8, doubling the weight of the coal fuel cost forces the system to reduce fuel consumption (from 2.626 × 10<sup>7</sup> to 2.607 × 10<sup>7</sup> $) and instead rely more on purchasing electricity (cost increased from 4.654 × 10<sup>6</sup> to 6.376 × 10<sup>6</sup> $). In Scenario 9, a higher penalty on CO<sub>2</sub> emissions prompts an even more significant shift away from thermal generation, achieving the lowest fuel (2.418 × 10<sup>7</sup> $) and emission (2.151 × 10<sup>7</sup> $) costs at the expense of having the highest electricity purchase cost (8.909 × 10<sup>6</sup> $). Conversely, in Scenario 10, when the electricity purchase cost is penalized, the system minimizes its reliance on the grid, leading to the lowest purchase cost (4.231 × 10<sup>5</sup> $) but increasing its own thermal generation, which in turn raises both fuel and emission costs. This demonstrates the model’s ability to intelligently adapt its energy dispatch strategy in response to varying economic and environmental signals.</p>
      </sec>
      <sec id="sec3-2">
        <title>Robustness verification</title>
        <p>Scenario 7 was specifically designed to validate the robustness of the optimal dispatch strategy (developed in Scenario 6) against the inherent uncertainties of renewable energy generation. To achieve this, we introduced stochastic fluctuations, modeled via an AR(1) process, to the wind and solar power data to simulate real-world forecasting errors.</p>
        <p>The results demonstrate strong performance stability. The total operational cost for Scenario 7 was 5.379 × 10<sup>7</sup> $ [<xref ref-type="table" rid="t2">Table 2</xref>], representing a marginal increase of only 0.019 × 10<sup>7</sup> $ - or approximately 0.35% - compared to the 5.360 × 10<sup>7</sup> $ cost of its deterministic counterpart, Scenario 6. This minimal cost deviation, despite the introduction of significant input noise, signifies that the DRL agent has acquired a resilient and adaptive dispatch policy. This resilience is critical for practical implementation, as it confirms the model’s capability to maintain near-optimal economic performance even when faced with imperfect environmental forecasts, thereby validating its robustness for real-world deployment.</p>
      </sec>
      <sec id="sec3-3">
        <title>Comparison of training speeds</title>
        <p>To assess the computational efficiency of the proposed algorithm, we evaluate the performance of multi-threaded parallel training. All experiments are conducted on a single workstation equipped with an 11th Gen Intel Core i7-11800H CPU (8 Cores, 16 Threads) and 32 GB of RAM. The performance metrics, summarized in <xref ref-type="table" rid="t3">Table 3</xref>, demonstrate a significant reduction in the total wall-clock training time as the number of parallel worker threads increases.</p>
        <table-wrap id="t3">
          <label>Table 3</label>
          <caption>
            <p>Parallel training performance evaluation</p>
          </caption>
          <table frame="hsides" rules="groups">
  <tbody>
    <tr>
      <td>
        <bold>Threads number</bold>
      </td>
      <td>
        <bold>Total wall clock training time (h)</bold>
      </td>
      <td>
        <bold>Speedup ratio</bold>
      </td>
    </tr>
    <tr>
      <td>1 (Standard)</td>
      <td>72.5</td>
      <td>1.00x</td>
    </tr>
    <tr>
      <td>2</td>
      <td>37.2</td>
      <td>1.95x</td>
    </tr>
    <tr>
      <td>4</td>
      <td>19.1</td>
      <td>3.80x</td>
    </tr>
    <tr>
      <td>6</td>
      <td>11.5</td>
      <td>5.50x</td>
    </tr>
  </tbody>
</table>
        </table-wrap>
        <p>The baseline single-threaded training required 72.5 h to complete. By scaling the number of workers to 2, 4, and 6, the training time was dramatically reduced to 37.2, 19.1, and 11.5 h, respectively. This corresponds to empirical speedup ratios of 1.95x, 3.80x, and 5.50x. The speedup is observed to be nearly linear up to 4 threads, indicating efficient parallelization with minimal communication overhead. Beyond this point, a slight drop-off in linear scaling efficiency is noted. The results validate that the multi-threaded training architecture effectively leverages multi-core CPUs to accelerate the learning process. The observed scalability confirms that employing a moderate number of parallel workers presents an optimal trade-off between computational resource allocation and training time reduction for this task.</p>
      </sec>
    </sec>
    <sec id="sec4">
      <title>CONCLUSIONS</title>
      <p>This paper develops a DRL-based scheduling framework for integrated wind-solar-hydrogen systems, incorporating advanced data-driven techniques for load and scenario characterization. The integration of DTW-DBSCAN and PCA-enhanced K-means clustering allows the framework to effectively navigate the inherent stochasticity of renewable generation and load demand. Experimental evaluations indicate that the proposed methodology yields an approximately 6% reduction in total annual operational expenditures compared to the baseline Scenario 1, down from 5.696 × 10<sup>7</sup> to 5.360 × 10<sup>7</sup> $. This economic performance is primarily driven by the agent’s ability to execute a sophisticated energy time-shift arbitrage strategy, effectively converting surplus renewable power into hydrogen storage. The proposed control architecture successfully transforms hydrogen-based energy storage from a passive backup component into a proactive, economically versatile asset for high-renewable grids. Furthermore, the proposed scalable distributed training architecture significantly accelerates the training speed and substantially improves data throughput, highlighting a robust computational capacity for large-scale energy dispatch optimization. Future work will focus on extending this framework to multi-agent collaborative scheduling and incorporating more complex dynamics from evolving carbon-linked energy markets. Additionally, we plan to investigate the optimization effects of employing a non-linear semi-empirical behavior model for hydrogen storage, comparing its efficacy against the current constant efficiency model in dynamic microgrid scheduling.</p>
    </sec>
  </body>
  <back>
    <sec>
      <title>DECLARATIONS</title>
      <sec>
        <title>Authors’ contributions</title>
        <p>Conceptualization, methodology, software, writing - original draft: Zhang, B.</p>
        <p>Data curation, investigation, formal analysis: Wang, C.</p>
        <p>Supervision, writing - review and editing, project administration, funding acquisition: Ma, Y.</p>
        <p>Validation, visualization: Xie, J.</p>
        <p>Supervision, resources, investigation: He, L.</p>
      </sec>
      <sec>
        <title>Availability of data and materials</title>
        <p>The data generated during the current study are available from the corresponding author on reasonable request. The publicly available input time-series data used in this study were obtained from Open Power System Data (OPSD)<sup>[<xref ref-type="bibr" rid="B45">45</xref>]</sup>, Time Series, available at URL (<uri xlink:href="https://doi.org/10.25832/time_series/2020-10-06">https://doi.org/10.25832/time_series/2020-10-06</uri>).</p>
      </sec>
      <sec>
        <title>Financial support and sponsorship</title>
        <p>The work has been supported in part by the Natural Science Foundation of Shaanxi Province, China (No. 2023-JC-QN-0687), in part by the Central University Basic Research Fund of China.</p>
      </sec>
      <sec>
        <title>AI and AI-assisted tools statement</title>
        <p>During the preparation of this manuscript, the AI tool Gemini (version 3.1, released 2026-02-19) was used solely for language editing. The tool did not influence the study design, data collection, analysis, interpretation, or the scientific content of the work. All authors take full responsibility for the accuracy, integrity, and final content of the manuscript.</p>
      </sec>
      <sec>
        <title>Conflicts of interest</title>
        <p>All authors declared that there are no conflicts of interest.</p>
      </sec>
      <sec>
        <title>Ethical approval and consent to participate</title>
        <p>Not applicable.</p>
      </sec>
      <sec>
        <title>Consent for publication</title>
        <p>Not applicable.</p>
      </sec>
      <sec>
        <title>Copyright</title>
        <p>©The Author(s) 2026.</p>
      </sec>
    </sec>
    <ref-list>
      <ref id="B1">
        <label>1</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wang</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Ma</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Xie</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Ouyang</surname>
              <given-names>Q.</given-names>
            </name>
          </person-group>
          <article-title>Multi-objective energy dispatch with deep reinforcement learning for wind-solar-thermal-storage hybrid systems</article-title>
          <source>J. Energy Storage</source>
          <year>2025</year>
          <volume>105</volume>
          <fpage>114635</fpage>
          <pub-id pub-id-type="doi">10.1016/j.est.2024.114635</pub-id>
        </element-citation>
      </ref>
      <ref id="B2">
        <label>2</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Hui</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Guo</surname>
              <given-names>S.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Comprehensive review of development and applications of hydrogen energy technologies in China for carbon neutrality: technology advances and challenges</article-title>
          <source>Energy Convers. Manag.</source>
          <year>2024</year>
          <volume>315</volume>
          <fpage>118776</fpage>
          <pub-id pub-id-type="doi">10.1016/j.enconman.2024.118776</pub-id>
        </element-citation>
      </ref>
      <ref id="B3">
        <label>3</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Liu</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Xie</surname>
              <given-names>X.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Energy structure and carbon emission: analysis against the background of the current energy crisis in the EU</article-title>
          <source>Energy</source>
          <year>2023</year>
          <volume>280</volume>
          <fpage>128129</fpage>
          <pub-id pub-id-type="doi">10.1016/j.energy.2023.128129</pub-id>
        </element-citation>
      </ref>
      <ref id="B4">
        <label>4</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Mallapaty</surname>
              <given-names>S.</given-names>
            </name>
          </person-group>
          <article-title>How China could be carbon neutral by mid-century</article-title>
          <source>Nature</source>
          <year>2020</year>
          <volume>586</volume>
          <fpage>482</fpage>
          <lpage>3</lpage>
          <pub-id pub-id-type="doi">10.1038/d41586-020-02927-9</pub-id>
          <pub-id pub-id-type="pmid">33077972</pub-id>
        </element-citation>
      </ref>
      <ref id="B5">
        <label>5</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cheng</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Kirschen</surname>
              <given-names>D. S.</given-names>
            </name>
            <name>
              <surname>Huang</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Kang</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>Planning multiple energy systems for low-carbon districts with high penetration of renewable energy: an empirical study in China</article-title>
          <source>Appl. Energy</source>
          <year>2020</year>
          <volume>261</volume>
          <fpage>114390</fpage>
          <pub-id pub-id-type="doi">10.1016/j.apenergy.2019.114390</pub-id>
        </element-citation>
      </ref>
      <ref id="B6">
        <label>6</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Liu</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Zhu</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>F.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Economic operation strategy of electricity-gas-thermal-hydrogen integrated energy system taking into account the cost of carbon</article-title>
          <source>Electr. Power Constr.</source>
          <year>2021</year>
          <volume>42</volume>
          <fpage>21</fpage>
          <lpage>29</lpage>
          <comment> (in Chinese)</comment>
          <pub-id pub-id-type="doi">10.12204/j.issn.1000-7229.2021.12.003</pub-id>
        </element-citation>
      </ref>
      <ref id="B7">
        <label>7</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Zhuo</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Du</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>N.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Cost increments of electricity supply to achieve carbon neutrality in China</article-title>
          <source>Nat. Commun.</source>
          <year>2022,</year>
          <comment> <italic>13</italic>, 3172.</comment>
          <pub-id pub-id-type="doi">10.1038/s41467-022-30747-0</pub-id>
          <pub-id pub-id-type="pmid">35676273</pub-id>
          <pub-id pub-id-type="pmcid">PMC9177843</pub-id>
        </element-citation>
      </ref>
      <ref id="B8">
        <label>8</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Jin</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Zhou</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Guo</surname>
              <given-names>X.</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Low-carbon power dispatch with wind power based on carbon trading mechanism</article-title>
          <source>Energy</source>
          <year>2019</year>
          <volume>170</volume>
          <fpage>250</fpage>
          <lpage>60</lpage>
          <pub-id pub-id-type="doi">10.1016/j.energy.2018.12.126</pub-id>
        </element-citation>
      </ref>
      <ref id="B9">
        <label>9</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Zhang</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Ji</surname>
              <given-names>X.</given-names>
            </name>
            <name>
              <surname>Yang</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Ye</surname>
              <given-names>P.</given-names>
            </name>
          </person-group>
          <article-title>Optimal scheduling of electricity-gas-heat integrated energy system with coordination of flexibility and reliability</article-title>
          <source>Sustain. Energy Technol. Assess.</source>
          <year>2024</year>
          <volume>71</volume>
          <fpage>103968</fpage>
          <pub-id pub-id-type="doi">10.1016/j.seta.2024.103968</pub-id>
        </element-citation>
      </ref>
      <ref id="B10">
        <label>10</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kumar</surname>
              <given-names>C. M. S.</given-names>
            </name>
            <name>
              <surname>Singh</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Gupta</surname>
              <given-names>M. K.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Solar energy: a promising renewable source for meeting energy demand in Indian agriculture applications</article-title>
          <source>Sustain. Energy Technol. Assess.</source>
          <year>2023</year>
          <volume>55</volume>
          <fpage>102905</fpage>
          <pub-id pub-id-type="doi">10.1016/j.seta.2022.102905</pub-id>
        </element-citation>
      </ref>
      <ref id="B11">
        <label>11</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gangopadhyay</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Seshadri</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Sparks</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Toumi</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>The role of wind-solar hybrid plants in mitigating renewable energy-droughts</article-title>
          <source>Renew. Energy</source>
          <year>2022</year>
          <volume>194</volume>
          <fpage>926</fpage>
          <lpage>37</lpage>
          <pub-id pub-id-type="doi">10.1016/j.renene.2022.05.122</pub-id>
        </element-citation>
      </ref>
      <ref id="B12">
        <label>12</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Huang</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Liu</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Ming</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Kim</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Gong</surname>
              <given-names>Y.</given-names>
            </name>
          </person-group>
          <article-title>Economic operation of a wind-solar-hydro complementary system considering risks of output shortage, power curtailment and spilled water</article-title>
          <source>Appl. Energy</source>
          <year>2021</year>
          <volume>290</volume>
          <fpage>116805</fpage>
          <pub-id pub-id-type="doi">10.1016/j.apenergy.2021.116805</pub-id>
        </element-citation>
      </ref>
      <ref id="B13">
        <label>13</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Liu</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Quan</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Gao</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Alotaibi</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Alsenani</surname>
              <given-names>T. R.</given-names>
            </name>
            <name>
              <surname>Abuhussain</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Green energy forecasting using multiheaded convolutional LSTM model for sustainable life</article-title>
          <source>Sustain. Energy Technol. Assess.</source>
          <year>2024</year>
          <volume>63</volume>
          <fpage>103609</fpage>
          <pub-id pub-id-type="doi">10.1016/j.seta.2024.103609</pub-id>
        </element-citation>
      </ref>
      <ref id="B14">
        <label>14</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Shoaei</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Hajinezhad</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Moosavian</surname>
              <given-names>S. F.</given-names>
            </name>
          </person-group>
          <article-title>Design, energy, exergy, economy, and environment (4E) analysis, and multi-objective optimization of a novel integrated energy system based on solar and geothermal resources</article-title>
          <source>Energy</source>
          <year>2023</year>
          <volume>280</volume>
          <fpage>128162</fpage>
          <pub-id pub-id-type="doi">10.1016/j.energy.2023.128162</pub-id>
        </element-citation>
      </ref>
      <ref id="B15">
        <label>15</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Reddy</surname>
              <given-names>S. S.</given-names>
            </name>
          </person-group>
          <article-title>Optimal scheduling of thermal-wind-solar power system with storage</article-title>
          <source>Renew. Energy</source>
          <year>2017</year>
          <volume>101</volume>
          <fpage>1357</fpage>
          <lpage>68</lpage>
          <pub-id pub-id-type="doi">10.1016/j.renene.2016.10.022</pub-id>
        </element-citation>
      </ref>
      <ref id="B16">
        <label>16</label>
        <element-citation publication-type="conference">
          <person-group person-group-type="author">
            <name>
              <surname>Li</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Choi</surname>
              <given-names>S. S.</given-names>
            </name>
            <name>
              <surname>Vilathgamuwa</surname>
              <given-names>D. M.</given-names>
            </name>
            <name>
              <surname>Yao</surname>
              <given-names>D. L.</given-names>
            </name>
          </person-group>
          <comment>An improved dispatchable wind turbine generator and dual-battery energy storage system to reduce battery capacity requirement. In <italic>2016 IEEE 2nd Annual Southern Power Electronics Conference (SPEC)</italic>, Auckland, New Zealand, December 5-8, 2016; IEEE: New York, NY, USA, 2016; pp 1-6</comment>
          <pub-id pub-id-type="doi">10.1109/SPEC.2016.7846143</pub-id>
        </element-citation>
      </ref>
      <ref id="B17">
        <label>17</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Sgouridis</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Carbajales-dale</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Csala</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Chiesa</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Bardi</surname>
              <given-names>U.</given-names>
            </name>
          </person-group>
          <article-title>Comparative net energy analysis of renewable electricity and carbon capture and storage</article-title>
          <source>Nat. Energy</source>
          <year>2019</year>
          <volume>4</volume>
          <fpage>456</fpage>
          <lpage>65</lpage>
          <pub-id pub-id-type="doi">10.1038/s41560-019-0365-7</pub-id>
        </element-citation>
      </ref>
      <ref id="B18">
        <label>18</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Van Leeuwen</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>De Wit</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Smit</surname>
              <given-names>G.</given-names>
            </name>
          </person-group>
          <article-title>Review of urban energy transition in the Netherlands and the role of smart energy management</article-title>
          <source>Energy Convers. Manag.</source>
          <year>2017</year>
          <volume>150</volume>
          <fpage>941</fpage>
          <lpage>8</lpage>
          <pub-id pub-id-type="doi">10.1016/j.enconman.2017.05.081</pub-id>
        </element-citation>
      </ref>
      <ref id="B19">
        <label>19</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Li</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Lu</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Yi</surname>
              <given-names>X.</given-names>
            </name>
            <name>
              <surname>An</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Hao</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>Energy systems capacity planning under high renewable penetration considering concentrating solar power</article-title>
          <source>Sustain. Energy Technol. Assess.</source>
          <year>2024</year>
          <volume>64</volume>
          <fpage>103671</fpage>
          <pub-id pub-id-type="doi">10.1016/j.seta.2024.103671</pub-id>
        </element-citation>
      </ref>
      <ref id="B20">
        <label>20</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Özkan</surname>
              <given-names>O.</given-names>
            </name>
            <name>
              <surname>Alola</surname>
              <given-names>A. A.</given-names>
            </name>
            <name>
              <surname>Adebayo</surname>
              <given-names>T. S.</given-names>
            </name>
          </person-group>
          <article-title>Environmental benefits of nonrenewable energy efficiency and renewable energy intensity in the USA and EU: examining the role of clean technologies</article-title>
          <source>Sustain. Energy Technol. Assess.</source>
          <year>2023</year>
          <volume>58</volume>
          <fpage>103315</fpage>
          <pub-id pub-id-type="doi">10.1016/j.seta.2023.103315</pub-id>
        </element-citation>
      </ref>
      <ref id="B21">
        <label>21</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Haddad</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Ramadan</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Khaled</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Ramadan</surname>
              <given-names>H. S.</given-names>
            </name>
            <name>
              <surname>Becherif</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Triple hybrid system coupling fuel cell with wind turbine and thermal solar system</article-title>
          <source>Int. J. Hydrogen Energy</source>
          <year>2020</year>
          <volume>45</volume>
          <fpage>11484</fpage>
          <lpage>91</lpage>
          <pub-id pub-id-type="doi">10.1016/j.ijhydene.2019.05.143</pub-id>
        </element-citation>
      </ref>
      <ref id="B22">
        <label>22</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Zrelli</surname>
              <given-names>M. H.</given-names>
            </name>
          </person-group>
          <article-title>Renewable energy, non-renewable energy, carbon dioxide emissions and economic growth in selected Mediterranean countries</article-title>
          <source>Environ. Econ. Policy Stud.</source>
          <year>2016</year>
          <volume>19</volume>
          <fpage>691</fpage>
          <lpage>709</lpage>
          <pub-id pub-id-type="doi">10.1007/s10018-016-0170-5</pub-id>
        </element-citation>
      </ref>
      <ref id="B23">
        <label>23</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Asumadu-sarkodie</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Owusu</surname>
              <given-names>P. A.</given-names>
            </name>
          </person-group>
          <article-title>The causal effect of carbon dioxide emissions, electricity consumption, economic growth, and industrialization in Sierra Leone</article-title>
          <source>Energ Source Part B</source>
          <year>2016</year>
          <volume>12</volume>
          <fpage>32</fpage>
          <lpage>9</lpage>
          <pub-id pub-id-type="doi">10.1080/15567249.2016.1225135</pub-id>
        </element-citation>
      </ref>
      <ref id="B24">
        <label>24</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Yao</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Yang</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Yan</surname>
              <given-names>Y.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Knowledge-extractor: a self-evolving scientific framework for hydrogen energy research driven by AI agents</article-title>
          <source>AI Agent</source>
          <year>2025</year>
          <volume>1</volume>
          <fpage>7</fpage>
          <pub-id pub-id-type="doi">10.20517/aiagent.2025.04</pub-id>
        </element-citation>
      </ref>
      <ref id="B25">
        <label>25</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Adetokun</surname>
              <given-names>B. B.</given-names>
            </name>
            <name>
              <surname>Oghorada</surname>
              <given-names>O.</given-names>
            </name>
            <name>
              <surname>Abubakar</surname>
              <given-names>S. J.</given-names>
            </name>
          </person-group>
          <article-title>Superconducting magnetic energy storage systems: prospects and challenges for renewable energy applications</article-title>
          <source>J. Energy Storage</source>
          <year>2022</year>
          <volume>55</volume>
          <fpage>105663</fpage>
          <pub-id pub-id-type="doi">10.1016/j.est.2022.105663</pub-id>
        </element-citation>
      </ref>
      <ref id="B26">
        <label>26</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Jahanger</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Ozturk</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Chukwuma Onwe</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Joseph</surname>
              <given-names>T. E.</given-names>
            </name>
            <name>
              <surname>Razib Hossain</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Do technology and renewable energy contribute to energy efficiency and carbon neutrality? Evidence from top ten manufacturing countries</article-title>
          <source>Sustain. Energy Technol. Assess.</source>
          <year>2023</year>
          <volume>56</volume>
          <fpage>103084</fpage>
          <pub-id pub-id-type="doi">10.1016/j.seta.2023.103084</pub-id>
        </element-citation>
      </ref>
      <ref id="B27">
        <label>27</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Xie</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Ma</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Yang</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Ouyang</surname>
              <given-names>Q.</given-names>
            </name>
          </person-group>
          <article-title>Spatio-temporal multi-head graph attention network for power forecasting of regional photovoltaic plants</article-title>
          <source>Solar Energy</source>
          <year>2026</year>
          <volume>304</volume>
          <fpage>114202</fpage>
          <pub-id pub-id-type="doi">10.1016/j.solener.2025.114202</pub-id>
        </element-citation>
      </ref>
      <ref id="B28">
        <label>28</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Fu</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Liu</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Scenario decomposition method for multi-objective stochastic dynamic economical dispatch problem</article-title>
          <source>Autom. Electr. Power Syst.</source>
          <year>2014</year>
          <volume>38</volume>
          <fpage>34</fpage>
          <lpage>40</lpage>
          <comment> (in Chinese)</comment>
          <pub-id pub-id-type="doi">10.7500/AEPS20131028006</pub-id>
        </element-citation>
      </ref>
      <ref id="B29">
        <label>29</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wang</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Luo</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Dong</surname>
              <given-names>Z. Y.</given-names>
            </name>
            <name>
              <surname>Ranzi</surname>
              <given-names>G.</given-names>
            </name>
          </person-group>
          <article-title>Joint planning of active distribution networks considering renewable power uncertainty</article-title>
          <source>Int. J. Electr. Power Energy Syst.</source>
          <year>2019</year>
          <volume>110</volume>
          <fpage>696</fpage>
          <lpage>704</lpage>
          <pub-id pub-id-type="doi">10.1016/j.ijepes.2019.03.034</pub-id>
        </element-citation>
      </ref>
      <ref id="B30">
        <label>30</label>
        <element-citation publication-type="conference">
          <person-group person-group-type="author">
            <name>
              <surname>Zhang</surname>
              <given-names>X.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Qian</surname>
              <given-names>Y.</given-names>
            </name>
            <etal/>
          </person-group>
          <comment>Two-level optimal allocation method of integrated energy system considering source and load uncertainty and carbon trading. In <italic>2023 IEEE 6th International Electrical and Energy Conference (CIEEC)</italic>, Hefei, China, May 12-14, 2023; IEEE: New York, NY, USA, 2023; pp 2351-6</comment>
          <pub-id pub-id-type="doi">10.1109/CIEEC58067.2023.10166764</pub-id>
        </element-citation>
      </ref>
      <ref id="B31">
        <label>31</label>
        <element-citation publication-type="journal">
          <article-title>Zhang, Juntao, Cheng, Chuntian, Shen, Jianjian, Li, G, Li, X, Zhao, Z. Short-term joint optimal operation method for high proportion renewable energy grid considering wind-solar uncertainty</article-title>
          <source>Proc. CSEE</source>
          <year>2020</year>
          <volume>40</volume>
          <fpage>5921</fpage>
          <lpage>32</lpage>
          <comment> (in Chinese)</comment>
          <pub-id pub-id-type="doi">10.13334/j.0258-8013.pcsee.191200</pub-id>
        </element-citation>
      </ref>
      <ref id="B32">
        <label>32</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wu</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Xu</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Shi</surname>
              <given-names>Z.</given-names>
            </name>
          </person-group>
          <article-title>Low carbon economic dispatch of integrated energy system considering extended electric heating demand response</article-title>
          <source>Energy</source>
          <year>2023</year>
          <volume>278</volume>
          <fpage>127902</fpage>
          <pub-id pub-id-type="doi">10.1016/j.energy.2023.127902</pub-id>
        </element-citation>
      </ref>
      <ref id="B33">
        <label>33</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Zheng</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Wei</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Bai</surname>
              <given-names>X.</given-names>
            </name>
          </person-group>
          <article-title>A data-driven method for operation pattern analysis of the integrated energy microgrid</article-title>
          <source>Energy Convers. Manag. X</source>
          <year>2021</year>
          <volume>11</volume>
          <fpage>100092</fpage>
          <pub-id pub-id-type="doi">10.1016/j.ecmx.2021.100092</pub-id>
        </element-citation>
      </ref>
      <ref id="B34">
        <label>34</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Salpakari</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Lund</surname>
              <given-names>P.</given-names>
            </name>
          </person-group>
          <article-title>Optimal and rule-based control strategies for energy flexibility in buildings with PV</article-title>
          <source>Appl. Energy</source>
          <year>2016</year>
          <volume>161</volume>
          <fpage>425</fpage>
          <lpage>36</lpage>
          <pub-id pub-id-type="doi">10.1016/j.apenergy.2015.10.036</pub-id>
        </element-citation>
      </ref>
      <ref id="B35">
        <label>35</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>He</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Liu</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Y.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>X.</given-names>
            </name>
            <name>
              <surname>Man</surname>
              <given-names>Y.</given-names>
            </name>
          </person-group>
          <article-title>Optimal operation of wind-solar-thermal collaborative power system considering carbon trading and energy storage</article-title>
          <source>Appl. Energy</source>
          <year>2023</year>
          <volume>352</volume>
          <fpage>121993</fpage>
          <pub-id pub-id-type="doi">10.1016/j.apenergy.2023.121993</pub-id>
        </element-citation>
      </ref>
      <ref id="B36">
        <label>36</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Yang</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Zhao</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Zomaya</surname>
              <given-names>A. Y.</given-names>
            </name>
          </person-group>
          <article-title>Dynamic energy dispatch strategy for integrated energy system based on improved deep reinforcement learning</article-title>
          <source>Energy</source>
          <year>2021</year>
          <volume>235</volume>
          <fpage>121377</fpage>
          <pub-id pub-id-type="doi">10.1016/j.energy.2021.121377</pub-id>
        </element-citation>
      </ref>
      <ref id="B37">
        <label>37</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Sanjari</surname>
              <given-names>M. J.</given-names>
            </name>
            <name>
              <surname>Karami</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Gooi</surname>
              <given-names>H. B.</given-names>
            </name>
          </person-group>
          <article-title>Analytical rule-based approach to online optimal control of smart residential energy system</article-title>
          <source>IEEE Trans. Ind. Inf.</source>
          <year>2017</year>
          <volume>13</volume>
          <fpage>1586</fpage>
          <lpage>97</lpage>
          <pub-id pub-id-type="doi">10.1109/tii.2017.2651879</pub-id>
        </element-citation>
      </ref>
      <ref id="B38">
        <label>38</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cai</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Zhao</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Xu</surname>
              <given-names>Q.</given-names>
            </name>
          </person-group>
          <article-title>Energy network dispatch optimization under emergency of local energy shortage</article-title>
          <source>Energy</source>
          <year>2012</year>
          <volume>42</volume>
          <fpage>132</fpage>
          <lpage>45</lpage>
          <pub-id pub-id-type="doi">10.1016/j.energy.2012.04.001</pub-id>
        </element-citation>
      </ref>
      <ref id="B39">
        <label>39</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Liao</surname>
              <given-names>G.</given-names>
            </name>
          </person-group>
          <article-title>A novel evolutionary algorithm for dynamic economic dispatch with energy saving and emission reduction in power system integrated wind power</article-title>
          <source>Energy</source>
          <year>2011</year>
          <volume>36</volume>
          <fpage>1018</fpage>
          <lpage>29</lpage>
          <pub-id pub-id-type="doi">10.1016/j.energy.2010.12.006</pub-id>
        </element-citation>
      </ref>
      <ref id="B40">
        <label>40</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Tian</surname>
              <given-names>X.</given-names>
            </name>
          </person-group>
          <article-title>Energy storage complementary control method for wind‐solar storage combined power generation system under opportunity constraint</article-title>
          <source>The Journal of Engineering</source>
          <year>2023</year>
          <volume>2023</volume>
          <fpage>e12256</fpage>
          <pub-id pub-id-type="doi">10.1049/tje2.12256</pub-id>
        </element-citation>
      </ref>
      <ref id="B41">
        <label>41</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lei</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Zhang</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Liao</surname>
              <given-names>W.</given-names>
            </name>
          </person-group>
          <article-title>Long-term, multi-stage low-carbon planning model of electricity-gas-heat integrated energy system considering ladder-type carbon trading mechanism and CCS</article-title>
          <source>Energy</source>
          <year>2023</year>
          <volume>280</volume>
          <fpage>128113</fpage>
          <pub-id pub-id-type="doi">10.1016/j.energy.2023.128113</pub-id>
        </element-citation>
      </ref>
      <ref id="B42">
        <label>42</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Lin</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Huang</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>Analysis of emission reduction effects of carbon trading: Market mechanism or government intervention?</article-title>
          <source>Sustain. Prod. Consum.</source>
          <year>2022</year>
          <volume>33</volume>
          <fpage>28</fpage>
          <lpage>37</lpage>
          <pub-id pub-id-type="doi">10.1016/j.spc.2022.06.016</pub-id>
        </element-citation>
      </ref>
      <ref id="B43">
        <label>43</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Qi</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Huang</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Fan</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Xu</surname>
              <given-names>B.</given-names>
            </name>
          </person-group>
          <article-title>Long-term energy management for microgrid with hybrid hydrogen-battery energy storage: a prediction-free coordinated optimization framework</article-title>
          <source>Appl. Energy</source>
          <year>2025</year>
          <volume>377</volume>
          <fpage>124485</fpage>
          <pub-id pub-id-type="doi">10.1016/j.apenergy.2024.124485</pub-id>
        </element-citation>
      </ref>
      <ref id="B44">
        <label>44</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Xiao</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Li</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Xie</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Yu</surname>
              <given-names>L.</given-names>
            </name>
          </person-group>
          <article-title>Decarbonizing China's power sector by 2030 with consideration of technological progress and cross-regional power transmission</article-title>
          <source>Energy Policy</source>
          <year>2021</year>
          <volume>150</volume>
          <fpage>112150</fpage>
          <pub-id pub-id-type="doi">10.1016/j.enpol.2021.112150</pub-id>
        </element-citation>
      </ref>
      <ref id="B45">
        <label>45</label>
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Wiese</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Schlecht</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Bunke</surname>
              <given-names>W.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Open Power System Data - frictionless data for electricity system modelling</article-title>
          <source>Appl. Energy</source>
          <year>2019</year>
          <volume>236</volume>
          <fpage>401</fpage>
          <lpage>9</lpage>
          <pub-id pub-id-type="doi">10.1016/j.apenergy.2018.11.097</pub-id>
        </element-citation>
      </ref>
    </ref-list>
  </back>
</article>
