5 13

Wenkai Yang

Keven16

https://keven980716.github.io/

keven980716

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Mixture of Horizons in Action Chunking

upvoted a paper about 1 month ago

Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

commented on a paper about 1 month ago

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

View all activity

Organizations

None yet

upvoted a paper 6 days ago

Mixture of Horizons in Action Chunking

Paper • 2511.19433 • Published 14 days ago • 17

upvoted a paper about 1 month ago

Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

Paper • 2510.27623 • Published Oct 31 • 12

commented a paper about 1 month ago

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

Paper • 2510.24320 • Published Oct 28 • 18 •

authored a paper about 2 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16 • 39

upvoted 2 papers about 2 months ago

Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

Paper • 2502.12459 • Published Feb 18 • 2

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16 • 39

updated a collection about 2 months ago

LaSeR

Collection

Models from the paper "LaSeR: Reinforcement Learning with Last-Token Self-Rewarding" • 5 items • Updated Oct 17 • 1

commented a paper about 2 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16 • 39 •

updated a collection about 2 months ago

LaSeR

Collection

Models from the paper "LaSeR: Reinforcement Learning with Last-Token Self-Rewarding" • 5 items • Updated Oct 17 • 1

published a dataset about 2 months ago

Keven16/LaSeR_training_data

Viewer • Updated Oct 16 • 104k • 56 • 2

published 3 models about 2 months ago

updated a dataset about 2 months ago

Keven16/LaSeR_training_data

Viewer • Updated Oct 16 • 104k • 56 • 2

updated a model about 2 months ago

Keven16/Qwen2.5-7B-LaSeR

8B • Updated Oct 15 • 8

upvoted a collection about 2 months ago

AEPO

Collection

The official datasets and model checkpoints of AEPO • 4 items • Updated Oct 21 • 4

Wenkai Yang

AI & ML interests

Recent Activity

Organizations

Keven16's activity