Follow
Qingqing Cao
Qingqing Cao
AIML Research Scientist, Apple
Verified email at apple.com - Homepage
Title
Cited by
Cited by
Year
MobiRNN: Efficient recurrent neural network execution on mobile GPU
Q Cao, N Balasubramanian, A Balasubramanian
EMDL@MobiSys 2017, Proceedings of the 1st International Workshop on Deep …, 2017
752017
Efficient methods for natural language processing: A survey
M Treviso, JU Lee, T Ji, B Aken, Q Cao, MR Ciosici, M Hassid, K Heafield, ...
TACL 2023, Transactions of the Association for Computational Linguistics 11 …, 2023
712023
DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering
Q Cao, H Trivedi, A Balasubramanian, N Balasubramanian
ACL 2020, 2020
662020
A survey for efficient open domain question answering
Q Zhang, S Chen, D Xu, Q Cao, X Chen, T Cohn, M Fang
ACL 2023, 2022
282022
Towards Accurate and Reliable Energy Measurement of NLP Models
Q Cao, A Balasubramanian, N Balasubramanian
SustaiNLP@EMNLP 2020, 2020
282020
Uiwear: Easily adapting user interfaces for wearable devices
J Xu*, Q Cao*, A Prakash, A Balasubramanian, DE Porter
MobiCom 2017, Proceedings of the 23rd Annual International Conference on …, 2017
272017
Deqa: On-device question answering
Q Cao, N Weber, N Balasubramanian, A Balasubramanian
MobiSys 2019, Proceedings of the 17th Annual International Conference on …, 2019
172019
IrEne: Interpretable Energy Prediction for Transformers
Q Cao, YK Lal, H Trivedi, A Balasubramanian, N Balasubramanian
ACL 2021, 2021
162021
Pumer: Pruning and merging tokens for efficient vision language models
Q Cao, B Paranjape, H Hajishirzi
ACL 2023, 2023
152023
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework
S Mehta, MH Sekhavat, Q Cao, M Horton, Y Jin, C Sun, I Mirzadeh, ...
ES-FoMo@ICML2024, 2024
13*2024
Apt: Adaptive pruning and tuning pretrained language models for efficient training and inference
B Zhao, H Hajishirzi, Q Cao
ICML 2024 (oral), 2024
92024
MobiVQA: Efficient On-Device Visual Question Answering
Q Cao, P Khanna, ND Lane, A Balasubramanian
UbiComp 2022, Proceedings of the ACM on Interactive, Mobile, Wearable and …, 2022
72022
Are mobile dnn accelerators accelerating dnns?
Q Cao, AE Irimiea, M Abdelfattah, A Balasubramanian, ND Lane
EMDL@MobiSys 2021, Proceedings of the 5th International Workshop on Embedded …, 2021
62021
Adanns: A framework for adaptive semantic search
A Rege, A Kusupati, A Fan, Q Cao, S Kakade, P Jain, A Farhadi
NeurIPS 2023, Advances in Neural Information Processing Systems 36, 76311-76335, 2023
32023
Efficiency pentathlon: A standardized arena for efficiency evaluation
H Peng, Q Cao, J Dodge, ME Peters, J Fernandez, T Sherborne, K Lo, ...
arXiv preprint arXiv:2307.09701, 2023
32023
BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models
Q Cao, S Min, Y Wang, H Hajishirzi
ICLR 2024 (spotlight), 2023
12023
IrEne-viz: Visualizing Energy Consumption of Transformer Models
YK Lal, R Singh, H Trivedi, Q Cao, A Balasubramanian, ...
Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021
12021
Bew: Towards Answering Business-Entity-Related Web Questions
Q Cao, O Riva, A Balasubramanian, N Balasubramanian
arXiv preprint arXiv:2012.05818, 2020
12020
Efficient Natural Language Processing for Heterogeneous Platforms
Q Cao
State University of New York at Stony Brook, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–19