Seuraa
Shuming Ma
Shuming Ma
Microsoft Research Asia
Vahvistettu sähköpostiosoite verkkotunnuksessa microsoft.com - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
SGM: sequence generation model for multi-label classification
P Yang, X Sun, W Li, S Ma, W Wu, H Wang
arXiv preprint arXiv:1806.04822, 2018
3992018
Language is not all you need: Aligning perception with language models
S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ...
Advances in Neural Information Processing Systems 36, 2024
2212024
Global encoding for abstractive summarization
J Lin, X Sun, S Ma, Q Su
arXiv preprint arXiv:1805.03989, 2018
1812018
Why can gpt learn in-context? language models secretly perform gradient descent as meta optimizers
D Dai, Y Sun, L Dong, Y Hao, Z Sui, F Wei
arXiv preprint arXiv:2212.10559, 2022
169*2022
meprop: Sparsified back propagation for accelerated deep learning with reduced overfitting
X Sun, X Ren, S Ma, H Wang
International Conference on Machine Learning, 3299-3308, 2017
1692017
Kosmos-2: Grounding Multimodal Large Language Models to the World
Z Peng, W Wang, L Dong, Y Hao, S Huang, S Ma, F Wei
arXiv preprint arXiv:2306.14824, 2023
1562023
Graph of thoughts: Solving elaborate problems with large language models
M Besta, N Blach, A Kubicek, R Gerstenberger, L Gianinazzi, J Gajda, ...
arXiv preprint arXiv:2308.09687, 2023
1372023
Deepnet: Scaling transformers to 1,000 layers
H Wang, S Ma, L Dong, S Huang, D Zhang, F Wei
arXiv preprint arXiv:2203.00555, 2022
1042022
Xlm-e: Cross-lingual language model pre-training via electra
Z Chi, S Huang, L Dong, S Ma, B Zheng, S Singhal, P Bajaj, X Song, ...
arXiv preprint arXiv:2106.16138, 2021
952021
A simple and effective unified encoder for document-level machine translation
S Ma, D Zhang, M Zhou
Proceedings of the 58th annual meeting of the association for computational …, 2020
892020
Improving semantic relevance for sequence-to-sequence learning of chinese social media text summarization
S Ma, X Sun, J Xu, H Wang, W Li, Q Su
arXiv preprint arXiv:1706.02459, 2017
772017
Bag-of-words as target for neural machine translation
S Ma, X Sun, Y Wang, J Lin
arXiv preprint arXiv:1805.04871, 2018
722018
Query and output: Generating words by querying distributed word representations for paraphrase generation
S Ma, X Sun, W Li, S Li, W Li, X Ren
arXiv preprint arXiv:1803.01465, 2018
712018
Language models are general-purpose interfaces
Y Hao, H Song, L Dong, S Huang, Z Chi, W Wang, S Ma, F Wei
arXiv preprint arXiv:2206.06336, 2022
672022
Semantic-unit-based dilated convolution for multi-label text classification
J Lin, Q Su, P Yang, S Ma, X Sun
arXiv preprint arXiv:1808.08561, 2018
652018
Alternating language modeling for cross-lingual pre-training
J Yang, S Ma, D Zhang, S Wu, Z Li, M Zhou
Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 9386-9393, 2020
642020
mT6: Multilingual pretrained text-to-text transformer with translation pairs
Z Chi, L Dong, S Ma, SHXL Mao, H Huang, F Wei
arXiv preprint arXiv:2104.08692, 2021
612021
Deltalm: Encoder-decoder pre-training for language generation and translation by augmenting pretrained multilingual encoders
S Ma, L Dong, S Huang, D Zhang, A Muzio, S Singhal, HH Awadalla, ...
arXiv preprint arXiv:2106.13736, 2021
582021
A deep reinforced sequence-to-set model for multi-label classification
P Yang, F Luo, S Ma, J Lin, X Sun
Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019
582019
A hierarchical end-to-end model for jointly improving text summarization and sentiment classification
S Ma, X Sun, J Lin, X Ren
arXiv preprint arXiv:1805.01089, 2018
572018
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20