Search




Research outputs

Multi-Agent Actor-Critic Multitask Reinforcement Learning based on GTD(1) with Consensus   [2022]

Stankovic, Milos S.  ; Beko, Marko  ; Ilic, Nemanja  ; Stankovic, Srdjan S.

Decentralized Multi-Agent Multi-Task Q-Learning with Function Approximation for POMDPs   [2024]

M. Stanković  ; M. Beko  ; S. Stanković

Distributed Consensus-Based Multi-Agent Off-Policy Temporal-Difference Learning   [2021]

Stankovic, Milos S.  ; Beko, Marko  ; Stankovic, Srdjan S.

Filters

By type