Print Friendly Search Results

Encapera, Angelo Michael, author.

A New Reinforcement Learning Algorithm with Fixed Exploration for Semi-Markov Decision Processes

Encapera, Angelo Michael, author.

9780438115859

Encapera, Angelo Michael, author.

1 electronic resource (52 pages)

Source: Masters Abstracts International, Volume: 57-06M(E).

Advisors: Abhijit Gosavi Committee members: David Enke; Zeyi Sun.

Artificial intelligence or machine learning techniques are currently being widely applied for solving problems within the field of data analytics. This work presents and demonstrates the use of a new machine learning algorithm for solving semi-Markov decision processes (SMDPs). SMDPs are encountered in the domain of Reinforcement Learning to solve control problems in discrete-event systems. The new algorithm developed here is called iSMART, an acronym for imaging Semi-Markov Average Reward Technique. The algorithm uses a constant exploration rate, unlike its precursor R-SMART, which required exploration decay. The major difference between R-SMART and iSMART is that the latter uses, in addition to the regular iterates of R-SMART, a set of so-called imaging iterates, which form an image of the regular iterates and allow iSMART to avoid exploration decay. The new algorithm is tested extensively on small-scale SMDPs and on large-scale problems from the domain of Total Productive Maintenance (TPM). The algorithm shows encouraging performance on all the cases studied.

School code: 0587

Artificial intelligence.

Systems science.

Missouri University of Science and Technology. Systems Engineering.

http://gateway.proquest.com/openurl?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&res_dat=xri:pqm&rft_dat=xri:pqdiss:10642082