Seuraa
Sören Mindermann
Sören Mindermann
Vahvistettu sähköpostiosoite verkkotunnuksessa cs.ox.ac.uk - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
Inferring the effectiveness of government interventions against COVID-19
J Brauner*, S Mindermann*, M Sharma*, D Johnston, J Salvatier, ...
Science 371 (6531), 2021
10812021
Understanding the effectiveness of government interventions against the resurgence of COVID-19 in Europe
M Sharma*, S Mindermann*, C Rogers-Smith, G Leech, B Snodin, J Ahuja, ...
Nature Communications 12 (1), 1-13, 2021
2272021
The alignment problem from a deep learning perspective
R Ngo, L Chan, S Mindermann
ICLR 2024, 2022
1392022
Occam's razor is insufficient to infer the preferences of irrational agents
S Armstrong*, S Mindermann*
NeurIPS, 2018
128*2018
Changing composition of SARS-CoV-2 lineages and rise of Delta variant in England
S Mishra*, S Mindermann*, M Sharma*, C Whittaker*, TA Mellan, T Wilton, ...
EClinicalMedicine - The Lancet 39, 101064, 2021
125*2021
Prioritized training on points that are learnable, worth learning, and not yet learned
S Mindermann*, M Razzak*, W Xu, A Kirsch, M Sharma, A Morisot, ...
ICML, 2022
972022
Mask wearing in community settings reduces SARS-CoV-2 transmission
G Leech, C Rogers-Smith, JT Monrad, JB Sandbrink, B Snodin, R Zinkov, ...
Proceedings of the National Academy of Sciences 119 (23), e2119266119, 2022
96*2022
Is the cure really worse than the disease? The health impacts of lockdowns during COVID-19
G Meyerowitz-Katz, S Bhatt, O Ratmann, JM Brauner, S Flaxman, ...
BMJ global health 6 (8), e006653, 2021
862021
Identifying Causal-Effect Inference Failure with Uncertainty-Aware Models
A Jesson*, S Mindermann*, U Shalit, Y Gal
NeurIPS, 2020
832020
Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding
A Jesson, S Mindermann, Y Gal, U Shalit
ICML, 2021
582021
Managing AI risks in an era of rapid progress
Y Bengio, G Hinton, A Yao, D Song, P Abbeel, YN Harari, YQ Zhang, ...
Science 384 (6698), 2023
562023
Active Inverse Reward Design
S Mindermann*, R Shah*, A Gleave, D Hadfield-Menell
arXiv preprint arXiv:1809.03060, 2018
53*2018
Seasonal variation in SARS-CoV-2 transmission in temperate climates: A Bayesian modelling study in 143 European regions
T Gavenčiak, JT Monrad, G Leech, M Sharma, S Mindermann, S Bhatt, ...
PLoS computational biology 18 (8), e1010435, 2022
522022
How Robust are the Estimated Effects of Nonpharmaceutical Interventions against COVID-19?
M Sharma*, S Mindermann*, J Brauner*, G Leech, A Stephenson, ...
NeurIPS (Spotlight talk), 2020
31*2020
Sleeper agents: Training deceptive llms that persist through safety training
E Hubinger, C Denison, J Mu, M Lambert, M Tong, M MacDiarmid, ...
arXiv preprint arXiv:2401.05566, 2024
282024
How to catch an ai liar: Lie detection in black-box llms by asking unrelated questions
L Pacchiardi, AJ Chan, S Mindermann, I Moscovitz, AY Pan, Y Gal, ...
ICLR 2024, 2023
252023
Managing extreme AI risks amid rapid progress
Y Bengio, G Hinton, A Yao, D Song, P Abbeel, T Darrell, YN Harari, ...
Science 384 (6698), 842-845, 2024
232024
Effectiveness assessment of non-pharmaceutical interventions: lessons learned from the COVID-19 pandemic
A Lison, N Banholzer, M Sharma, S Mindermann, HJT Unwin, S Mishra, ...
The Lancet Public Health 8 (4), e311-e317, 2023
212023
Inferring the effectiveness of government interventions against COVID-19. Science, eabd9338
JM Brauner, S Mindermann, M Sharma, D Johnston, J Salvatier, ...
202020
Specific versus general principles for constitutional ai
S Kundu, Y Bai, S Kadavath, A Askell, A Callahan, A Chen, A Goldie, ...
arXiv preprint arXiv:2310.13798, 2023
132023
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20