Seuraa
Laura Weidinger
Laura Weidinger
Staff Research Scientist at DeepMind
Vahvistettu sähköpostiosoite verkkotunnuksessa google.com
Nimike
Viittaukset
Viittaukset
Vuosi
Scaling language models: Methods, analysis & insights from training gopher
JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ...
arXiv preprint arXiv:2112.11446, 2021
8592021
Ethical and social risks of harm from language models
L Weidinger, J Mellor, M Rauh, C Griffin, J Uesato, PS Huang, M Cheng, ...
arXiv preprint arXiv:2112.04359, 2021
7062021
Taxonomy of risks posed by language models
L Weidinger, J Uesato, M Rauh, C Griffin, PS Huang, J Mellor, A Glaese, ...
Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022
3992022
Improving alignment of dialogue agents via targeted human judgements
A Glaese, N McAleese, M Trębacz, J Aslanides, V Firoiu, T Ewalds, ...
arXiv preprint arXiv:2209.14375, 2022
3542022
Alignment of language agents
Z Kenton, T Everitt, L Weidinger, I Gabriel, V Mikulik, G Irving
arXiv preprint arXiv:2103.14659, 2021
1332021
Sociotechnical safety evaluation of generative ai systems
L Weidinger, M Rauh, N Marchal, A Manzini, LA Hendricks, ...
arXiv preprint arXiv:2310.11986, 2023
612023
Ethical and social risks of harm from language models. arXiv
L Weidinger, J Mellor, M Rauh, C Griffin, J Uesato, PS Huang, M Cheng, ...
arXiv preprint arXiv:2112.04359 10, 2021
442021
Characteristics of harmful text: Towards rigorous benchmarking of language models
M Rauh, J Mellor, J Uesato, PS Huang, J Welbl, L Weidinger, S Dathathri, ...
Advances in Neural Information Processing Systems 35, 24720-24739, 2022
372022
Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents
R Köster, D Hadfield-Menell, R Everett, L Weidinger, GK Hadfield, ...
Proceedings of the National Academy of Sciences 119 (3), e2106028118, 2022
362022
Social conformity in autism
SC Lazzaro, L Weidinger, RA Cooper, S Baron-Cohen, C Moutsiana, ...
Journal of Autism and Developmental Disorders 49, 1304-1315, 2019
262019
Using the Veil of Ignorance to align AI systems with principles of justice
L Weidinger, KR McKee, R Everett, S Huang, TO Zhu, MJ Chadwick, ...
Proceedings of the National Academy of Sciences 120 (18), e2213709120, 2023
252023
Model-free conventions in multi-agent reinforcement learning with heterogeneous preferences
R Köster, KR McKee, R Everett, L Weidinger, WS Isaac, E Hughes, ...
arXiv preprint arXiv:2010.09054, 2020
242020
Accounting for offensive speech as a practice of resistance
M Díaz, R Amironesei, L Weidinger, I Gabriel
Proceedings of the sixth workshop on online abuse and harms (woah), 192-202, 2022
152022
Scaling language models: Methods, analysis & insights from training gopher. arXiv 2021
JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ...
arXiv preprint arXiv:2112.11446, 2021
132021
The ethics of advanced ai assistants
I Gabriel, A Manzini, G Keeling, LA Hendricks, V Rieser, H Iqbal, ...
arXiv preprint arXiv:2404.16244, 2024
102024
Test-retest reliability of canonical reinforcement learning models
L Weidinger, A Gradassi, L Molleman, W van den Bos
10*
Improving alignment of dialogue agents via targeted human judgements, 2022
A Glaese, N McAleese, M Trebacz, J Aslanides, V Firoiu, T Ewalds, ...
URL https://storage. googleapis. com/deepmind-media/DeepMind. com/Authors …, 2022
92022
Test–retest reliability of reinforcement learning parameters
JV Schaaf, L Weidinger, L Molleman, W van den Bos
Behavior Research Methods, 1-18, 2023
82023
Language modelling at scale: Gopher, ethical considerations, and retrieval
J Rae, G Irving, L Weidinger
DeepMind Blog, 2021
82021
Artificial moral cognition: Learning from developmental psychology
L Weidinger, M Reinecke, J Haas
42022
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20