Sangam: A Confluence of Knowledge Streams

Proximal algorithms and temporal difference methods for solving fixed point problems

Show simple item record

dc.creator Bertsekas, Dimitri P
dc.date 2021-09-20T17:30:42Z
dc.date 2021-09-20T17:30:42Z
dc.date 2018-03-02
dc.date 2020-09-24T21:34:30Z
dc.date.accessioned 2023-03-01T18:06:35Z
dc.date.available 2023-03-01T18:06:35Z
dc.identifier https://hdl.handle.net/1721.1/131865
dc.identifier.uri http://localhost:8080/xmlui/handle/CUHPOERS/278781
dc.description Abstract In this paper we consider large fixed point problems and solution with proximal algorithms. We show that for linear problems there is a close connection between proximal iterations, which are prominent in numerical analysis and optimization, and multistep methods of the temporal difference type such as TD( $$\lambda $$ λ ), LSTD( $$\lambda $$ λ ), and LSPE( $$\lambda $$ λ ), which are central in simulation-based exact and approximate dynamic programming. One benefit of this connection is a new and simple way to accelerate the standard proximal algorithm by extrapolation towards a multistep iteration, which generically has a faster convergence rate. Another benefit is the potential for integration into the proximal algorithmic context of several new ideas that have emerged in the approximate dynamic programming context, including simulation-based implementations. Conversely, the analytical and algorithmic insights from proximal algorithms can be brought to bear on the analysis and the enhancement of temporal difference methods. We also generalize our linear case result to nonlinear problems that involve a contractive mapping, thus providing guaranteed and potentially substantial acceleration of the proximal and forward backward splitting algorithms at no extra cost. Moreover, under certain monotonicity assumptions, we extend the connection with temporal difference methods to nonlinear problems through a linearization approach.
dc.format application/pdf
dc.language en
dc.publisher Springer US
dc.relation https://doi.org/10.1007/s10589-018-9990-5
dc.rights Creative Commons Attribution-Noncommercial-Share Alike
dc.rights http://creativecommons.org/licenses/by-nc-sa/4.0/
dc.rights Springer Science+Business Media, LLC, part of Springer Nature
dc.source Springer US
dc.title Proximal algorithms and temporal difference methods for solving fixed point problems
dc.type Article
dc.type http://purl.org/eprint/type/JournalArticle


Files in this item

Files Size Format View
10589_2018_9990_ReferencePDF.pdf 3.055Mb application/pdf View/Open

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse