3 21 2

Qingcheng Zeng

qcz

qcznlp

AI & ML interests

None yet

Recent Activity

updated a model 8 minutes ago

qcz/calibration-40

published a model 12 minutes ago

qcz/calibration-40

updated a model about 3 hours ago

qcz/baseline-20

View all activity

Organizations

updated a model 8 minutes ago

qcz/calibration-40

Updated 8 minutes ago

published a model 12 minutes ago

qcz/calibration-40

Updated 8 minutes ago

updated a model about 3 hours ago

qcz/baseline-20

Updated about 3 hours ago

published a model about 3 hours ago

qcz/baseline-20

Updated about 3 hours ago

updated a model about 13 hours ago

qcz/calibration-20

Updated about 13 hours ago

published a model about 13 hours ago

qcz/calibration-20

Updated about 13 hours ago

upvoted a paper about 1 month ago

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures

Paper • 2510.24081 • Published Oct 28 • 16

upvoted a paper 2 months ago

Good Intentions Beyond ACL: Who Does NLP for Social Good, and Where?

Paper • 2510.04434 • Published Oct 6 • 5

commented a paper 2 months ago

Good Intentions Beyond ACL: Who Does NLP for Social Good, and Where?

Paper • 2510.04434 • Published Oct 6 • 5 •

upvoted 3 papers 2 months ago

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29 • 140

Multiplayer Nash Preference Optimization

Paper • 2509.23102 • Published Sep 27 • 62

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26 • 134

upvoted a paper 3 months ago

The Majority is not always right: RL training for solution aggregation

Paper • 2509.06870 • Published Sep 8 • 16

liked a model 3 months ago

google/embeddinggemma-300m

published a dataset 3 months ago

agentorg/webvoyager

Viewer • Updated Sep 3 • 60 • 405

updated a dataset 3 months ago

agentorg/webvoyager

Viewer • Updated Sep 3 • 60 • 405

upvoted 4 papers 4 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 180

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7 • 130

VeriGUI: Verifiable Long-Chain GUI Dataset

Paper • 2508.04026 • Published Aug 6 • 160

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Paper • 2508.00414 • Published Aug 1 • 93

Qingcheng Zeng

AI & ML interests

Recent Activity

Organizations

qcz's activity