邓天虎: Data-driven Convex Policy Optimization in an Assemble-to-order System

报告时间：2023年10月13日（星期五）14:30-15:30

报告地点：管理学院新大楼第二学术报告厅

报告人：邓天虎博士

工作单位：清华大学

举办单位：管理学院

报告简介：

This paper investigates the optimization of periodic-review assemble-to-order (ATO) production systems with multiple products assembled from multiple components, under the data-driven setting where only historical demand data is available and demand distributions are unknown. To address this challenge, we propose a semi-model-based fitted Q iteration (S-FQI) algorithm framework that leverages the known transition dynamics. We provide a proof of the statistical convergence rate of the proposed algorithm concerning the number of iterations, the number of demand samples, and the number of generated trajectories.

Additionally, we introduce the convex-TD3 (CTD3) algorithm to tackle practical challenges by incorporating the convex property of ATO systems and utilizing an input convex neural network (ICNN) to improve efficiency and effectiveness.

报告人简介：

邓天虎，邓天虎（博士，副教授）目前就职于清华大学工业工程系。2013年于美国加州大学伯克利分校获得工业工程与运筹博士学位，2008年于清华大学工业工程系获得学士学位。目前研究方向侧重智慧供应链。以第一作者和通讯作者在Manufacturing & Service Operations Management、Operations Research等国际学术期刊和学术会议发表论文20余篇。

1	智能计算与工业软件前沿技术研讨会报告九则
2	陈鑫: 原位钛同位素示踪岩浆-热液演化及金属富集成矿过程
3	申广君: Least squares estimation for path-distribution dependent SDEs driven by fractional Brownian motions
4	曾现来: 低碳转型背景下循环经济的挑战与机遇
5	Elakneswaran: 放射性废物固定化的地质聚合物技术——可持续核修复的途径
6	René Michael Koenigs: Photochemistry as a tool for reaction discovery with reactive intermediates
7	Dominik Grall: Lessons learned from grid restoration and islanded operation tests with hydropower plants
8	Wendelin Angermann: Analytical Optimization and Practical Verification of Reactive Power Supply
9	Alexander Fröhlich: Transformers Under DC Bias—Investigating GIC Effects and Mitigation Strategies
10	陶加华: 一维硒化锑薄膜的取向生长与缺陷钝化机制

邓天虎: Data-driven Convex Policy Optimization in an Assemble-to-order System
发布日期：2023-10-11 字号：大 中 小【打印】

点击排行榜

邓天虎: Data-driven Convex Policy Optimization in an Assemble-to-order System 发布日期：2023-10-11 字号：大 中 小 【打印】

点击排行榜

邓天虎: Data-driven Convex Policy Optimization in an Assemble-to-order System
发布日期：2023-10-11 字号：大中小【打印】