Zhang Zhiyuan
Zhang Zhiyuan
Verified email at - Homepage
Cited by
Cited by
Understanding and Improving Layer Normalization
J Xu, X Sun, Z Zhang, G Zhao, J Lin
Advances in Neural Information Processing Systems, 4381-4391, 2019
Explicit Sparse Transformer: Concentrated Attention Through Explicit Selection
G Zhao, J Lin, Z Zhang, X Ren, Q Su, X Sun
arXiv preprint arXiv:1912.11637, 2019
Be careful about poisoned word embeddings: Exploring the vulnerability of the embedding layers in NLP models
W Yang, L Li, Z Zhang, X Ren, X Sun, B He
arXiv preprint arXiv:2103.15543, 2021
MUSE: Parallel Multi-Scale Attention for Sequence to Sequence Learning
G Zhao, X Sun, J Xu, Z Zhang, L Luo
arXiv preprint arXiv:1911.09483, 2019
Rethinking Skip Connection with Layer Normalization
F Liu, X Ren, Z Zhang, X Sun, Y Zou
Proceedings of the 28th International Conference on Computational …, 2020
Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning
R Xu, F Luo, Z Zhang, C Tan, B Chang, S Huang, F Huang
arXiv preprint arXiv:2109.05687, 2021
Exploring the vulnerability of deep neural networks: A study of parameter corruption
X Sun, Z Zhang, X Ren, R Luo, L Li
arXiv preprint arXiv:2006.05620, 2020
Automatic Translating Between Ancient Chinese and Contemporary Chinese with Limited Aligned Corpora
Z Zhang, W Li, Q Su
CCF International Conference on Natural Language Processing and Chinese …, 2019
Pretrain-KGE: learning knowledge representation from pretrained language models
Z Zhang, X Liu, Y Zhang, Q Su, X Sun, B He
Findings of the Association for Computational Linguistics: EMNLP 2020, 259-266, 2020
Memorized sparse backpropagation
Z Zhang, P Yang, X Ren, Q Su, X Sun
Neurocomputing 415, 397-407, 2020
Neural Network Surgery: Injecting Data Patterns into Pre-trained Models with Minimal Instance-wise Side Effects
Z Zhang, X Ren, Q Su, X Sun, B He
Proceedings of the 2021 Conference of the North American Chapter of the …, 2021
Building an ellipsis-aware chinese dependency treebank for web text
X Ren, X Sun, J Wen, B Wei, W Zhan, Z Zhang
arXiv preprint arXiv:1801.06613, 2018
How to Inject Backdoors with Better Consistency: Logit Anchoring on Clean Data
Z Zhang, L Lyu, W Wang, L Sun, X Sun
arXiv preprint arXiv:2109.01300, 2021
Adversarial parameter defense by multi-step risk minimization
Z Zhang, R Luo, X Ren, Q Su, L Li, X Sun
Neural Networks 144, 154-163, 2021
ASAT: Adaptively Scaled Adversarial Training in Time Series
Z Zhang, W Li, R Bao, K Harimoto, Y Wu, X Sun
arXiv preprint arXiv:2108.08976, 2021
Learning Robust Representation for Clustering through Locality Preserving Variational Discriminative Network
R Luo, W Li, Z Zhang, R Bao, K Harimoto, X Sun
arXiv preprint arXiv:2012.13489, 2020
Primal Meaning Recommendation via On-line Encyclopedia
Z Zhang, W Li, J Xu, X Sun
arXiv preprint arXiv:1808.04660, 2018
The system can't perform the operation now. Try again later.
Articles 1–17