Follow
Thomas Mesnard
Thomas Mesnard
Research Scientist at Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Towards biologically plausible deep learning
Y Bengio, DH Lee, J Bornschein, T Mesnard, Z Lin
arXiv preprint arXiv:1502.04156, 2015
4182015
An objective function for STDP
Y Bengio, T Mesnard, A Fischer, S Zhang, Y Wu
arXiv preprint arXiv:1509.05936 5 (6.2), 6.3, 2015
175*2015
Rlaif: Scaling reinforcement learning from human feedback with ai feedback
H Lee, S Phatale, H Mansoor, K Lu, T Mesnard, C Bishop, V Carbune, ...
arXiv preprint arXiv:2309.00267, 2023
1372023
Hindsight credit assignment
A Harutyunyan, W Dabney, T Mesnard, M Gheshlaghi Azar, B Piot, ...
Advances in neural information processing systems 32, 2019
852019
Counterfactual credit assignment in model-free reinforcement learning
T Mesnard, T Weber, F Viola, S Thakoor, A Saade, A Harutyunyan, ...
arXiv preprint arXiv:2011.09464, 2020
612020
Generalization of equilibrium propagation to vector field dynamics
B Scellier, A Goyal, J Binas, T Mesnard, Y Bengio
arXiv preprint arXiv:1808.04873, 2018
46*2018
Geometric entropic exploration
ZD Guo, MG Azar, A Saade, S Thakoor, B Piot, BA Pires, M Valko, ...
arXiv preprint arXiv:2101.02055, 2021
372021
Towards deep learning with spiking neurons in energy based models with contrastive hebbian plasticity
T Mesnard, W Gerstner, J Brea
arXiv preprint arXiv:1612.03214, 2016
272016
Nash learning from human feedback
R Munos, M Valko, D Calandriello, MG Azar, M Rowland, ZD Guo, Y Tang, ...
arXiv preprint arXiv:2312.00886, 2023
102023
Curiosity in hindsight: intrinsic exploration in stochastic environments
D Jarrett, C Tallec, F Altché, T Mesnard, R Munos, M Valko
82023
Ghost units yield biologically plausible backprop in deep neural networks
T Mesnard, G Vignoud, J Sacramento, W Senn, Y Bengio
arXiv preprint arXiv:1911.08585, 2019
52019
From STDP towards Biologically Plausible Deep Learning
Y Bengio, A Fischer, T Mesnard, S Zhang, Y Wu
ICML 2015, Deep Learning Workshop, 2015
52015
Direct language model alignment from online ai feedback
S Guo, B Zhang, T Liu, T Liu, M Khalman, F Llinares, A Rame, T Mesnard, ...
arXiv preprint arXiv:2402.04792, 2024
42024
Gemma: Open models based on gemini research and technology
G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ...
arXiv preprint arXiv:2403.08295, 2024
22024
A survey of temporal credit assignment in deep reinforcement learning
E Pignatelli, J Ferret, M Geist, T Mesnard, H van Hasselt, L Toni
arXiv preprint arXiv:2312.01072, 2023
22023
Quantile credit assignment
T Mesnard, W Chen, A Saade, Y Tang, M Rowland, T Weber, C Lyle, ...
International Conference on Machine Learning, 24517-24531, 2023
12023
Activation alignment: exploring the use of approximate activity gradients in multilayer networks
T Mesnard, B Richards
2018 Conference on Cognitive Computational Neuroscience, Brentwood …, 2018
12018
Connectionist Temporal Classification: Labelling Unsegmented Sequences with Recurrent Neural Networks
A AUVOLAT, T MESNARD
12006
The system can't perform the operation now. Try again later.
Articles 1–18