Devstral 2 Collection A couple of agentic LLMs for software engineering tasks, excelling at using tools to explore codebases, edit multiple files, and power SWE Agents. • 3 items • Updated about 4 hours ago • 14
On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model Leaderboards Paper • 2407.04065 • Published Jul 4, 2024 • 1
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published Jul 1 • 242
SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs Paper • 2512.04746 • Published 5 days ago • 11
Mitigating Object and Action Hallucinations in Multimodal LLMs via Self-Augmented Contrastive Alignment Paper • 2512.04356 • Published 6 days ago • 9