CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing Z Gou, Z Shao, Y Gong, Y Shen, Y Yang, N Duan, W Chen ICLR 2024, 2023 | 242* | 2023 |
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving Z Gou, Z Shao, Y Gong, Y Yang, M Huang, N Duan, W Chen ICLR 2024, 2023 | 124 | 2023 |
Long Time No See! Open-Domain Conversation with Long-Term Persona Memory Z Gou*, X Xu*, W Wu, ZY Niu, H Wu, H Wang, S Wang ACL 2022 Findings, 2639–2650, 2022 | 96 | 2022 |
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence Q Zhu, D Guo, Z Shao, D Yang, P Wang, R Xu, Y Wu, Y Li, H Gao, S Ma, ... arXiv preprint arXiv:2406.11931, 2024 | 78* | 2024 |
MvP: Multi-view Prompting Improves Aspect Sentiment Tuple Prediction Z Gou*, Q Guo*, Y Yang ACL 2023, 2023 | 49 | 2023 |
Data interpreter: An LLM agent for data science arXiv preprint arXiv:2402.18679, 2024 | 37 | 2024 |
Rho-1: Not All Tokens Are What You Need Z Lin*, Z Gou*, Y Gong, X Liu, Y Shen, R Xu, C Lin, Y Yang, J Jiao, ... NeurIPS 2024 (Oral), 2024 | 32 | 2024 |
Key-point-driven data synthesis with its enhancement on mathematical reasoning Y Huang, X Liu, Y Gong, Z Gou, Y Shen, N Duan, W Chen AAAI 2025, 2024 | 16 | 2024 |
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning Z Lin*, Z Gou*, T Liang, R Luo, H Liu, Y Yang ACL 2024 Findings, 2024 | 15 | 2024 |
DeepSeek-Prover-V1. 5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search H Xin, ZZ Ren, J Song, Z Shao, W Zhao, H Wang, B Liu, L Zhang, X Lu, ... arXiv preprint arXiv:2408.08152, 2024 | 11 | 2024 |
SciAgent: Tool-augmented Language Models for Scientific Reasoning Y Ma, Z Gou, J Hao, R Xu, S Wang, L Pan, Y Yang, Y Cao, A Sun EMNLP 2024, 2024 | 9 | 2024 |
Exploring the Mystery of Influential Data for Mathematical Reasoning X Ni, Y Gong, Z Gou, Y Shen, Y Yang, N Duan, W Chen COLM 2024, 2024 | 2 | 2024 |