Results
eNauka >
Results >
Distributed Gradient Temporal Difference Off-policy Learning With Eligibility Traces: Weak Convergence
| Title: | Distributed Gradient Temporal Difference Off-policy Learning With Eligibility Traces: Weak Convergence | Authors: | Stanković, Miloš S. |
Issue Date: | 2020 | Publication: | Proc. 21st IFAC World Congress | ISSN: | 2405-8963![]() Search Idenfier |
Publisher: | IFAC | Type: | Conference Paper | Collation: | vol. 53 br. 2 str. 1563-1568 | DOI: | 10.1016/j.ifacol.2020.12.2184 | WoS-ID: | 000652592500253 | Scopus-ID: | 2-s2.0-85104545219 | URI: | http://ezaposleni.singidunum.ac.rs/rest/sciNaucniRezultati/oai/record/1/8089 https://enauka.gov.rs/handle/123456789/330169 |
URL: | https://doi.org/10.1016%2Fj.ifacol.2020.12.2184 | Metadata source: | Migracija | M-category: | Mp. category will be shown later |
Items in eNauka are protected by copyright, with all rights reserved, unless otherwise indicated.
