absorb.md

Cognition Labs (Devin)

Chronological feed of everything captured from Cognition Labs (Devin).

Windsurf Shifts Coding Benchmarks to Real-World Comparative Testing via Arena Mode

Windsurf has launched Arena Mode, a comparative evaluation framework that allows developers to test two LLMs against a single prompt within their own specific codebase. This approach shifts the benchmark from static datasets to real-world production environments to account for variance in codebase and stack compatibility.

Cognition Labs Transitions X Handle

Cognition Labs has officially transitioned its X (formerly Twitter) handle from an unspecified previous account to @cognition. This change centralizes their social media presence under a more direct and brand-aligned identifier. The move indicates a potential brand consolidation or simplification effort.