Bruno Castro da Silva
Cited by
Cited by
Dealing with non-stationary environments using context detection
BC da Silva, EW Basso, ALC Bazzan, PM Engel
International Conference on Machine Learning (ICML 2006), 217-224, 2006
Learning parameterized skills
BC da Silva, G Konidaris, A Barto
International Conference on Machine Learning (ICML 2012), 2012
Preventing undesirable behavior of intelligent machines
PS Thomas, B Castro da Silva, AG Barto, S Giguere, Y Brun, E Brunskill
Science 366 (6468), 999-1004, 2019
Learning in groups of traffic signals
ALC Bazzan, D De Oliveira, BC da Silva
Engineering Applications of Artificial Intelligence 23 (4), 560-568, 2010
Gaussian Processes for Learning and Control: A Tutorial with Examples
M Liu, G Chowdhary, BC Da Silva, SY Liu, JP How
IEEE Control Systems Magazine 38 (5), 53-86, 2018
Reinforcement Learning based Control of Traffic Lights in Non-stationary Environments: A Case Study in a Microscopic Simulator.
D de Oliveira, ALC Bazzan, BC da Silva, EW Basso, L Nunes, R Rossetti, ...
4th European Workshop on Multi-Agent Systems (EUMAS 2006), 2006
ITSUMO: an intelligent transportation system for urban mobility
BC Da Silva, R Junges, D de Oliveira, ALC Bazzan
[Demonstration Track] (AAMAS 2006) - Proceedings of the 5th International …, 2006
A task-and-technique centered survey on visual analytics for deep learning model engineering
R Garcia, AC Telea, BC da Silva, J Tørresen, JLD Comba
Computers & Graphics 77, 30-49, 2018
Learning parameterized motor skills on a humanoid robot
BC Da Silva, G Baldassarre, G Konidaris, A Barto
IEEE International Conference on Robotics and Automation (ICRA 2014), 5239-5244, 2014
Universal off-policy evaluation
Y Chandak, S Niekum, B da Silva, E Learned-Miller, E Brunskill, ...
Advances in Neural Information Processing Systems (NeurIPS 2021) 34, 27475-27490, 2021
Analysing the impact of travel information for minimising the regret of route choice
GO Ramos, ALC Bazzan, BC da Silva
Transportation Research Part C: Emerging Technologies 88, 257-271, 2018
Fairness Guarantees under Demographic Shift
S Giguere, B Metevier, BC da Silva, Y Brun, PS Thomas, S Niekum
International Conference on Learning Representations (ICLR 2022), 2022
Adaptive traffic control with reinforcement learning
B da Silva, D Oliveira, AL Bazzan, EW Basso
4th Workshop on Agents in Traffic and Transportation (ATT@AAMAS 2006), 80-86, 2006
Optimistic linear support and successor features as a basis for optimal policy transfer
LN Alegre, A Bazzan, BC Da Silva
International Conference on Machine Learning (ICML 2022), 394-413, 2022
Improving reinforcement learning with context detection
BC Da Silva, EW Basso, FS Perotto, AL C Bazzan, PM Engel
(AAMAS 2006) Intl. Joint Conference on Autonomous Agents and Multiagent …, 2006
Active learning of parameterized skills
B Da Silva, G Konidaris, A Barto
International Conference on Machine Learning (ICML 2014), 1737-1745, 2014
Autonomous Reinforcement Learning of Multiple Interrelated Tasks
VG Santucci, E Cartoni, BC da Silva, G Baldassarre
International Conference on Development and Learning (ICDL 2019), 2019
Energetic natural gradient descent
P Thomas, BC Silva, C Dann, E Brunskill
International Conference on Machine Learning (ICML 2016), 2887-2895, 2016
MO-Gym: A Library of Multi-Objective Reinforcement Learning Environments
LN Alegre, F Felten, EG Talbi, G Danoy, A Nowé, ALC Bazzan, ...
Proceedings of the 34th Benelux Conference on Artificial Intelligence BNAIC …, 2022
Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection
LN Alegre, ALC Bazzan, BC da Silva
(AAMAS 2021) Intl. Conference on Autonomous Agents and Multiagent Systems …, 2021
The system can't perform the operation now. Try again later.
Articles 1–20