Search
Research outputs
Multi-Agent Actor-Critic Multitask Reinforcement Learning based on GTD(1) with Consensus
[2022]
Stankovic, Milos S. Decentralized Multi-Agent Multi-Task Q-Learning with Function Approximation for POMDPs
[2024]
M. Stanković Distributed Consensus-Based Multi-Agent Off-Policy Temporal-Difference Learning
[2021]
Stankovic, Milos S. Filters
By type
- 3