On the variance of the adaptive learning rate and beyond L Liu, H Jiang, P He, W Chen, X Liu, J Gao, J Han arXiv preprint arXiv:1908.03265, 2019 | 951 | 2019 |
Multi-task deep neural networks for natural language understanding X Liu, P He, W Chen, J Gao arXiv preprint arXiv:1901.11504, 2019 | 815 | 2019 |
Deberta: Decoding-enhanced bert with disentangled attention P He, X Liu, J Gao, W Chen arXiv preprint arXiv:2006.03654, 2020 | 347 | 2020 |
Reasonet: Learning to stop reading in machine comprehension Y Shen, PS Huang, J Gao, W Chen Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge …, 2017 | 282 | 2017 |
Short text conceptualization using a probabilistic knowledgebase Y Song, H Wang, Z Wang, H Li, W Chen Twenty-second international joint conference on artificial intelligence, 2011 | 259 | 2011 |
Fusionnet: Fusing via fully-aware attention with application to machine comprehension HY Huang, C Zhu, Y Shen, W Chen arXiv preprint arXiv:1711.07341, 2017 | 170 | 2017 |
Document transformation for multi-label feature selection in text categorization W Chen, J Yan, B Zhang, Z Chen, Q Yang Seventh IEEE International Conference on Data Mining (ICDM 2007), 451-456, 2007 | 170 | 2007 |
Smart: Robust and efficient fine-tuning for pre-trained natural language models through principled regularized optimization H Jiang, P He, W Chen, X Liu, J Gao, T Zhao arXiv preprint arXiv:1911.03437, 2019 | 143 | 2019 |
Improving multi-task deep neural networks via knowledge distillation for natural language understanding X Liu, P He, W Chen, J Gao arXiv preprint arXiv:1904.09482, 2019 | 122 | 2019 |
A novel click model and its applications to online advertising ZA Zhu, W Chen, T Minka, C Zhu, Z Chen Proceedings of the third ACM international conference on Web search and data …, 2010 | 118 | 2010 |
User-click modeling for understanding and predicting search-behavior Y Zhang, W Chen, D Wang, Q Yang Proceedings of the 17th ACM SIGKDD international conference on Knowledge …, 2011 | 115 | 2011 |
P-packSVM: Parallel primal gradient descent kernel SVM AZ Zeyuan, C Weizhu, W Gang, Z Chenguang, C Zheng 2009 Ninth IEEE International Conference on Data Mining, 677-686, 2009 | 93 | 2009 |
Characterizing search intent diversity into click models B Hu, Y Zhang, W Chen, G Wang, Q Yang Proceedings of the 20th international conference on World wide web, 17-26, 2011 | 83 | 2011 |
Understanding the difficulty of training transformers L Liu, X Liu, J Gao, W Chen, J Han arXiv preprint arXiv:2004.08249, 2020 | 79 | 2020 |
Personalized click model through collaborative filtering S Shen, B Hu, W Chen, Q Yang Proceedings of the fifth ACM international conference on Web search and data …, 2012 | 77 | 2012 |
Internet visualization system and related user interfaces M Wang, W Chen, B Zhang, Z Chen, J Wang US Patent 7,873,904, 2011 | 75 | 2011 |
Beyond ten blue links: enabling user click modeling in federated web search D Chen, W Chen, H Wang, Z Chen, Q Yang Proceedings of the fifth ACM international conference on Web search and data …, 2012 | 69 | 2012 |
Adversarial training for large neural language models X Liu, H Cheng, P He, W Chen, Y Wang, H Poon, J Gao arXiv preprint arXiv:2004.08994, 2020 | 67 | 2020 |
Large-scale L-BFGS using MapReduce W Chen, Z Wang, J Zhou Advances in neural information processing systems 27, 2014 | 67 | 2014 |
Method and apparatus for establishing relationship between documents QB Wang, WZ Chen, B Fei, Z Su US Patent 7,809,716, 2010 | 67 | 2010 |