DL-MPC （deep learning model predictive control）python 实现

DL-MPC（Deep Learning Model Predictive Control）是一种结合深度学习和模型预测控制的先进控制策略。其核心思想是利用深度学习模型来预测系统的未来行为，并通过模型预测控制来优化控制输入，从而实现对复杂系统的高效控制。

深度学习模型预测控制

参考链接：WangXiaoMingo/TensorDL-MPC：DL-MPC（深度学习模型预测控制）是基于 Python 和 TensorFlow 框架开发的软件工具包，旨在通过深度学习技术增强传统模型预测控制（MPC）的性能。该工具包提供模型训练、仿真、参数优化等核心功能。 (github.com)

在TensorFlow中实现DL-MPC的具体步骤如下：

定义预测模型：
- 使用深度学习框架（如TensorFlow或PyTorch）定义一个神经网络模型，该模型用于预测系统的未来状态。这通常包括输入当前状态和控制输入，输出未来状态的预测。
- 在TensorFlow中，可以使用tf.keras 模块来定义和构建神经网络模型。
实现MPC算法：
- 实现模型预测控制（MPC）算法，包括预测模型、滚动优化和控制律更新。
- 预测模型：使用定义的神经网络模型进行状态预测。
- 滚动优化：在每个时间步，使用预测模型进行多步预测，并通过优化目标函数来计算最优控制输入。
- 控制律更新：根据优化结果更新控制律，以实现对系统的控制。
训练神经网络模型：
- 使用历史数据训练神经网络模型，使其能够准确预测系统的未来状态。
- 在TensorFlow中，可以使用tf.keras 的fit方法进行训练。
集成到控制系统：
- 将训练好的神经网络模型和MPC算法集成到控制系统中。
- 在每个时间步，使用当前状态和控制输入，通过神经网络模型进行状态预测，并通过MPC算法计算最优控制输入。
- 更新系统状态并重复上述过程。

代码：


''' test bp-mpc'''
import pandas as pdif __name__ == '__main__':import numpy as npimport matplotlib.pyplot as pltfrom tensorflow.keras.models import load_modelimport tensorflow as tffrom src.dlmpc import SimSISOfrom src.dlmpc import MPCControllerfrom src.dlmpc import calculate_performance_metricsfrom src.dlmpc import optimizerfrom src.dlmpc import OnlineOptimizeimport timeimport osimport matplotlib as mplmpl.use('TkAgg')os.environ['CUDA_VISIBLE_DEVICES'] = '-1'plt.tick_params(labelsize=12)# TODO: Step1: parameters Settings#  NN parametes Settings# 窗口大小input_window_dy = 2input_window_du = 2ly = input_window_dy  # y的历史状态的长度lu = input_window_du-1  # u的历史输入的长度dim = ly+lu# MPC parameters Settings# Train_NN = TrueTrain_NN = Falsempc = Truepredict_horizon = 4 #10  # 预测时域  (2,1), (4,2 *),(3,3)control_horizon = 2 #5   # control_horizondim_u = 1             # control variable dimension = 1du_bounds = 10       # control variable constraints (delta u)u_bounds = [-5,5]          # control variable constraints (u)opt = optimizer(optimizer_name='sgd', learning_rate=0.1, du_bound=None, exponential_decay=False)error = 0# 定义权重矩阵Q = np.eye(predict_horizon) * 0.1   # 跟踪误差的权重矩阵  # 0.1R = np.eye(control_horizon) * 0.01  # 控制输入的权重矩阵 # 0.01N = 150  # 运行周期y_ref = 10  # 参考轨迹值# 初始化系统状态'''initial_state = np.array([1, 1.2])    # 初始状态可以自定义，y[0]，y[1]initial_input = np.array([0.1, 0.2])  # 初始状态可以自定义，u[0],u[1]'''state_y = tf.constant([[1], [1.2]], dtype=tf.float32)state_u = tf.constant([[0.1]], dtype=tf.float32)u0 = tf.constant([0.2], dtype=tf.float32)# TODO: Step2: load plant and parameters'''return plant: simulation'''simulation = SimSISO(noise_amplitude=0)# TODO: Step3: Load  NN model and trainingif Train_NN:'''get trained model'''import osscript_path = 'test_models_bp_regressor.py'os.system(f'python {script_path}')model = load_model(f'models_save/BP_predictor.h5')# model.summary()# TODO: Step4: mpc trainingif mpc:# 创建MPC控制器实例mpc_controller = MPCController(model, predict_horizon, control_horizon, Q, R, ly, lu, dim_u, [-du_bounds,du_bounds],u_bounds, opt)data = np.zeros((N-2,7))result = pd.DataFrame(data,columns=['Time','reference', 'System output','u','solving_time', 'epoch','error'])# 初始化图表plt.close()fig, ax = plt.subplots(3, 1)plt.ion()  # 打开交互模式# MPC控制循环for i in range(2,N):if i > 30:y_ref = 5  # 参考轨迹值if i > 70:y_ref = 10  # 参考轨迹值mpc = OnlineOptimize(mpc_controller,state_y, state_u, use_online_correction=True)# controller computationparameter = mpc.make_step(error, y_ref, iterations=100, tol=1e-6)  #u0 = parameter['u0']# system outputplant_output = simulation.plant(np.append(tf.squeeze(state_u), parameter['u0']),tf.squeeze(state_y))# estimate statestate_y, state_u, error = mpc.estimate(parameter['u0'], plant_output)print(f">>> Current Time: {i},\t Object J: {parameter['solving_J']:>=.4f}, \t Current u:{parameter['u0'][0]:>=.4f}, \t Current System output: {plant_output[0]:>=.4f}, \t Optimization epoch: {parameter['solving_epoch']}, \t Solving time:{parameter['solving_time']:>=.4f} s")result.at[i, 'Time'] = iresult.at[i, 'reference'] = y_refresult.at[i, 'System output'] = plant_outputresult.at[i, 'u'] = parameter['u0']result.at[i, 'epoch'] = parameter['solving_epoch']result.at[i, 'solving_time'] = parameter['solving_time']result.at[i, 'error'] = y_ref - plant_output# 动态更新图表for a in ax:a.clear()ax[0].plot(result['reference'][:i],'-',label='reference')    # 绘制当前变量的数据ax[0].plot(result['System output'][:i],'--',label='System output')  # 绘制当前变量的数据ax[0].legend(loc='upper right')ax[1].plot(result['u'][:i],'--',label='u') #ax[1].legend(loc='upper right')ax[2].plot(result['error'][:i],'--',label='error')ax[2].legend(loc='upper right')ax[0].set_ylabel('y') #fontdict= font2ax[0].set_xlabel('Time')ax[1].set_ylabel('u') #fontdict= font2ax[1].set_xlabel('Time')ax[2].set_ylabel('error') #fontdict= font2ax[2].set_xlabel('Time')# Set the font for tick labels to Times New Romanlabels = ax[0].get_xticklabels() + ax[1].get_xticklabels() + ax[2].get_xticklabels() + ax[0].get_yticklabels() + ax[1].get_yticklabels() + ax[2].get_yticklabels()for label in labels:label.set_fontname('Times New Roman')# Adjust the layoutplt.tight_layout()plt.draw()  # 绘制更新plt.pause(0.01)  # 暂停短时间，等待更新# 控制循环结束后，关闭图表plt.ioff()plt.close(fig)# plt.figure()plt.plot(result['reference'], '-', label='reference')  # 绘制当前变量的数据plt.plot(result['System output'], '--', label='System output')  # 绘制当前变量的数据plt.legend(loc='upper right')plt.show(block=False)plt.close()time = np.array(result['Time'])setpoint = np.array(result['reference'])sys_out = np.array(result['System output'])performance_metrics = calculate_performance_metrics([f'{model.name}_MPC'], sys_out, setpoint, time, percent_threshold=0.02,ise=True,iae=True,overshoot=False,peak_time=False,rise_time=False,rise_time1=False,settling_time=False,steady_state_error=False)# print(result)plt.figure()plt.plot(np.array(result['u']), label='input_u')plt.show()performance_metrics['mean_epoch'] = result['epoch'].mean()performance_metrics['mean_solving_time'] = result['solving_time'].mean()performance_metrics['mean_error'] = result['error'].mean()print(performance_metrics)

本文来自互联网用户投稿，该文观点仅代表作者本人，不代表本站立场。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如若转载，请注明出处：http://www.rhkb.cn/news/457640.html

如若内容造成侵权/违法违规/事实不符，请联系长河编程网进行投诉反馈email:809451989@qq.com，一经查实，立即删除！