A New Reinforcement Learning Algorithm with Fixed Exploration for Semi-Markov Decision Processes
tarafından
 
Encapera, Angelo Michael, author.

Başlık
A New Reinforcement Learning Algorithm with Fixed Exploration for Semi-Markov Decision Processes

Yazar
Encapera, Angelo Michael, author.

ISBN
9780438115859

Yazar Ek Girişi
Encapera, Angelo Michael, author.

Fiziksel Tanımlama
1 electronic resource (52 pages)

Genel Not
Source: Masters Abstracts International, Volume: 57-06M(E).
 
Advisors: Abhijit Gosavi Committee members: David Enke; Zeyi Sun.

Özet
Artificial intelligence or machine learning techniques are currently being widely applied for solving problems within the field of data analytics. This work presents and demonstrates the use of a new machine learning algorithm for solving semi-Markov decision processes (SMDPs). SMDPs are encountered in the domain of Reinforcement Learning to solve control problems in discrete-event systems. The new algorithm developed here is called iSMART, an acronym for imaging Semi-Markov Average Reward Technique. The algorithm uses a constant exploration rate, unlike its precursor R-SMART, which required exploration decay. The major difference between R-SMART and iSMART is that the latter uses, in addition to the regular iterates of R-SMART, a set of so-called imaging iterates, which form an image of the regular iterates and allow iSMART to avoid exploration decay. The new algorithm is tested extensively on small-scale SMDPs and on large-scale problems from the domain of Total Productive Maintenance (TPM). The algorithm shows encouraging performance on all the cases studied.

Notlar
School code: 0587

Konu Başlığı
Artificial intelligence.
 
Systems science.

Tüzel Kişi Ek Girişi
Missouri University of Science and Technology. Systems Engineering.

Elektronik Erişim
http://gateway.proquest.com/openurl?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&res_dat=xri:pqm&rft_dat=xri:pqdiss:10642082


Yer NumarasıDemirbaş NumarasıShelf LocationShelf LocationHolding Information
XX(688052.1)688052-1001Proquest E-Tez KoleksiyonuProquest E-Tez Koleksiyonu