Reinforcement Learning » Growing Science

Growing Science » Tags cloud » Reinforcement Learning

Journals

Countries

To reduce maximum tardiness by Seru Production: model, cooperative algorithm combining reinforcement learning and insights Pages 65-82 Download PDF

Authors: Guanghui Fu, Yang Yu, Wei Sun, Ikou Kaku

DOI: 10.5267/j.ijiec.2022.10.002

Keywords: Cooperative algorithm, Reinforcement learning, Maximum tardiness, Seru production

Abstract:

The maximum tardiness reflects the worst level of service associated with customer needs; thus, the principle that seru production reduces the maximum tardiness is investigated, and a model to minimize the maximum tardiness of the seru production system is established. In order to obtain the exact solution, the non-linear seru production model with minimizing the maximum tardiness is split into a seru formation model and a linear seru scheduling model. We propose an efficient cooperative algorithm using a genetic algorithm and an innovative reinforcement learning algorithm (CAGARL) for large-scale problems. Specifically, the GA is designed for the seru formation problem. Moreover, the QL-seru algorithm (QLSA) is designed for the seru scheduling problem by combining the features of meta-heuristics and reinforcement learning. In the QLSA, we design an innovative QL-seru table and two state trimming rules to save computational time. After extensive experiments, compared with the previous algorithm, CAGARL improved by an average of 56.6%. Finally, several managerial insights on reducing maximum tardiness are proposed.

Details

Simulation and modeling of human decision-making process through reinforcement learning based computational model involving past experiences Pages 366-378 Download PDF

Authors: Nimisha Gupta, Mitul Kumar Ahirwal, Mithilesh Atulkar

DOI: 10.5267/j.dsl.2022.9.001

Keywords: Past experiences, Decision-Making, Reinforcement Learning, Learning rules, Iowa Gambling Task

Abstract:

Experience plays a vital role in the decision-making (DM) process. In this paper simulation, modeling, and analysis of past experience over DM has been done using the Iowa gambling task (IGT). The Human DM process is very complex and difficult to model through computational methods because it is a subjective type of process and varies person-to-person. Therefore, this study is an attempt to simulate a DM model similar to the human DM process. For this collection of real data was done and was provided as input to the developed eight Reinforcement Learning (RL) models. The result shows that the performance of the model based on Prospect Utility (PU) learned with Decay Reinforcement Rule (DRI) and Trial Dependency Choice (TDC) is better as compared to other models. It is observed from the analysis of data and also validated that simulation and models output that the experienced group performs better than inexperienced.

Details

Solving blocking flowshop scheduling problem with makespan criterion using q-learning-based iterated greedy algorithms Pages 85-100 Download PDF

Authors: M. Fatih Tasgetiren, Damla Kizilay, Levent Kandiller

DOI: 10.5267/j.jpm.2024.2.002

Keywords: Q-learning-based iterated greedy algorithms, Reinforcement learning, Blocking flowshop scheduling problem

Abstract:

This study proposes Q-learning-based iterated greedy (IGQ) algorithms to solve the blocking flowshop scheduling problem with the makespan criterion. Q learning is a model-free machine intelligence technique, which is adapted into the traditional iterated greedy (IG) algorithm to determine its parameters, mainly, the destruction size and temperature scale factor, adaptively during the search process. Besides IGQ algorithms, two different mathematical modeling techniques. One of these techniques is the constraint programming (CP) model, which is known to work well with scheduling problems. The other technique is the mixed integer linear programming (MILP) model, which provides the mathematical definition of the problem. The introduction of these mathematical models supports the validation of IGQ algorithms and provides a comparison between different exact solution methodologies. To measure and compare the performance of IGQ algorithms and mathematical models, extensive computational experiments have been performed on both small and large VRF benchmarks available in the literature. Computational results and statistical analyses indicate that IGQ algorithms generate substantially better results when compared to non-learning IG algorithms.

Details