Victor Sanh

Cited by

	All	Since 2019
Citations	24068	23998
h-index	17	17
i10-index	17	17

10000

5000

2500

7500

201920202021202220232024169 2047 3978 6083 9016 2692

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Thomas WolfCo-founder at HuggingFaceVerified email at polytechnique.edu
Alexander M. RushAssociate Professor, Cornell UniversityVerified email at cornell.edu
Julien ChaumondHugging FaceVerified email at huggingface.co
Lysandre DebutMachine Learning Engineer, Hugging FaceVerified email at huggingface.co
Quentin LhoestHugging FaceVerified email at huggingface.co
Clément DelangueHugging FaceVerified email at huggingface.co
Canwen XuBoson AIVerified email at ucsd.edu
Yacine JerniteResearch Scientist, HuggingFaceVerified email at cs.nyu.edu
Joe DavisonUniversity of UtahVerified email at utah.edu
Julien PluResearch Scientist, LettriaVerified email at eurecom.fr
Rémi Louf🤗 Hugging Face Inc.Verified email at huggingface.co
Morgan FuntowiczHugging FaceVerified email at huggingface.co
Sam ShleiferFacebook AI ResearchVerified email at fb.com
Teven Le ScaoHugging FaceVerified email at huggingface.co
Lewis TunstallHugging FaceVerified email at itp.unibe.ch
Albert Villanova del MoralHugging FaceVerified email at huggingface.co
Sebastian RuderResearch Scientist, CohereVerified email at cohere.com
Thierry TambeResearch Scientist, NVIDIAVerified email at nvidia.com
François LagunasHugging FaceVerified email at huggingface.co
Yonatan BelinkovTechnionVerified email at technion.ac.il

Victor Sanh

Hugging Face

Verified email at huggingface.co

Natural Language Processing Machine Learning Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Transformers: State-of-the-art natural language processing T Wolf, L Debut, V Sanh, J Chaumond, C Delangue, A Moi, P Cistac, ... Proceedings of the 2020 conference on empirical methods in natural language …, 2020	12976*	2020
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter V Sanh, L Debut, J Chaumond, T Wolf arXiv preprint arXiv:1910.01108, 2019	6345*	2019
Multitask Prompted Training Enables Zero-Shot Task Generalization V Sanh, A Webson, C Raffel, SH Bach, L Sutawika, Z Alyafeai, A Chaffin, ... arXiv preprint arXiv:2110.08207, 2021	1210	2021
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... arXiv preprint arXiv:2211.05100, 2022	1134	2022
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents T Wolf, V Sanh, J Chaumond, C Delangue arXiv preprint arXiv:1901.08149, 2019	503	2019
Datasets: A Community Library for Natural Language Processing Q Lhoest, AV del Moral, Y Jernite, A Thakur, P von Platen, S Patil, ... arXiv preprint arXiv:2109.02846, 2021	399*	2021
Movement pruning: Adaptive sparsity by fine-tuning V Sanh, T Wolf, A Rush Advances in Neural Information Processing Systems 33, 20378-20389, 2020	367	2020
A hierarchical multi-task approach for learning embeddings from semantic tasks V Sanh, T Wolf, S Ruder Proceedings of the AAAI Conference on Artificial Intelligence 33, 6949-6956, 2019	261	2019
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts SH Bach, V Sanh, ZX Yong, A Webson, C Raffel, NV Nayak, A Sharma, ... arXiv preprint arXiv:2202.01279, 2022	238	2022
Block Pruning For Faster Transformers F Lagunas, E Charlaix, V Sanh, AM Rush arXiv preprint arXiv:2109.04838, 2021	155	2021
Interactive and Visual Prompt Engineering for Ad-hoc Task Adaptation with Large Language Models H Strobelt, A Webson, V Sanh, B Hoover, J Beyer, H Pfister, AM Rush IEEE transactions on visualization and computer graphics, 2022	104	2022
Edgebert: Sentence-level energy optimizations for latency-aware multi-task nlp inference T Tambe, C Hooper, L Pentecost, T Jia, EY Yang, M Donato, V Sanh, ... MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture …, 2021	91*	2021
Learning from others' mistakes: Avoiding dataset biases without modeling them V Sanh, T Wolf, Y Belinkov, AM Rush arXiv preprint arXiv:2012.01300, 2020	86	2020
What Language Model to Train if You Have One Million GPU Hours? T Le Scao, T Wang, D Hesslow, L Saulnier, S Bekman, MS Bari, ... Challenges {\&, 2022	78	2022
OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents H Laurençon, L Saulnier, L Tronchon, S Bekman, A Singh, A Lozhkov, ... arXiv preprint arXiv:2306.16527, 2023	61	2023
Low-Complexity Probing via Finding Subnetworks S Cao, V Sanh, AM Rush arXiv preprint arXiv:2104.03514, 2021	31	2021
Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning PA Utama, NS Moosavi, V Sanh, I Gurevych arXiv preprint arXiv:2109.04144, 2021	29	2021

The system can't perform the operation now. Try again later.

Articles 1–17

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors