Chronological feed of everything captured from Alexander Embiricos.
tweet / embirico / Mar 16
Codex now supports subagents, enabling users to delegate tasks to specialized AI instances. This feature, particularly effective with Spark for Pro users, allows for parallel task execution and a cleaner main context window. The introduction of subagents aims to accelerate workflows by improving task management and agent steering during complex operations, as highlighted by OpenAI Developers.
codexsubagentsai-agentsworkflow-automationopenai-devsdeveloper-tools
“Codex now allows users to employ subagents.”
tweet / embirico / Mar 7
Developer workflows are shifting toward application-based interfaces over traditional CLIs due to increased execution speed and reduced context-switching overhead. This trend is specifically exemplified by the deployment of the OpenAI Codex app, which optimizes the coding environment by minimizing the need for multiple open windows.
openai-codexidedeveloper-toolsapp-developmentcli-vs-gui
“Application-based interfaces have surpassed command-line interfaces (CLI) in preference or utility for certain workflows.”
tweet / embirico / Mar 6
OpenAI's Codex now offers a "fast mode" for its GPT-5.4 model, allowing users to prioritize execution speed over computational cost. This feature provides a 1.5x increase in speed at a 2x cost, enabling developers to optimize workflows based on their specific needs for rapid iteration and debugging versus budget constraints.
llm-inferencemodel-optimizationdeveloper-toolscost-managementai-tradeoffsopenai-codex
“OpenAI's Codex offers a 'fast mode' for its GPT-5.4 model.”
tweet / embirico / Mar 5
OpenAI's new GPT-5.4 model offers significant advancements, particularly in native computer interaction and agentic coding. It boasts an expanded context window of up to 1 million tokens in Codex and the API, enabling more complex and tool-heavy workflows. These improvements collectively position GPT-5.4 as a more capable model for professional and sophisticated coding tasks.
gpt-5.4openaillm-agentscoding-aiai-models
“GPT-5.4 possesses native computer-use capabilities.”
tweet / embirico / Feb 17
OpenAI has implemented several improvements to their request routing system for GPT-5.3-Codex, addressing issues that led to requests being downgraded to GPT-5.2. These changes primarily focus on refining risk classification policies, enhancing user notifications, and ensuring proper access restoration for verified users. The goal is to significantly reduce the incidence of user flagging and improve transparency regarding model downgrades.
gpt-5.3-codexgpt-5.2risk-managementcontent-moderationpolicy-calibrationplatform-stabilityopenai-update
“The number of requests and users flagged as elevated risk has been significantly reduced due to classifier and policy calibration.”
tweet / embirico / Feb 12
OpenAI has released GPT-5.3-Codex-Spark, a new ultra-fast model designed for real-time coding, available initially to ChatGPT Pro users as a research preview. This model leverages a partnership with Cerebras and features optimized infrastructure for low-latency performance. It is currently text-only with a 128k context window, with future plans for expanded capabilities and broader access.
gpt-5.3-codex-sparkai-modelsllm-inferencelow-latencydeveloper-toolscerebras-partnership
“OpenAI has launched GPT-5.3-Codex-Spark, an ultra-fast model for real-time coding.”
tweet / embirico / Feb 11
A social platform experienced a technical issue causing overflagging of suspicious activity for approximately 9% of its user base. The problem, active for a three-hour period, has been resolved, and the platform is implementing measures to prevent recurrence. This incident highlights the challenges in maintaining accurate real-time content moderation at scale.
bug-fixuser-impactplatform-reliabilityincident-responsesystem-monitoring
“A bug caused overflagging of potentially suspicious activity on a social platform.”
tweet / embirico / Feb 11
OpenAI implements a system where API requests routed to its 5.3-Codex model are automatically downgraded to GPT-5.2 for accounts identified as "flagged." This measure is temporary, with specific thresholds and durations for the downgrade actively being adjusted. Users with "Trusted Access for Cyber" are exempt from this policy.
gpt-5.3-codexapi-accesscontent-moderationtrusted-accessai-policy
“Flagged accounts have their 5.3-Codex API requests downgraded to GPT-5.2.”
tweet / embirico / Feb 11
OpenAI is rerouting requests from GPT-5.3-Codex to GPT-5.2 for users whose activity is flagged as potential cyber misuse. This measure is intended to reduce cyber abuse risk. The company plans to implement UI notifications for rerouted users and offers an appeal process for those incorrectly classified, while continuously refining its detection systems.
gpt-codexcontent-moderationcyber-abuse-preventionmodel-routingopenai-policiesuser-experience
“OpenAI routes requests from GPT-5.3-Codex to GPT-5.2 when elevated cyber misuse risk is detected.”
tweet / embirico / Feb 5
OpenAI has released GPT-5.3-Codex, demonstrating state-of-the-art performance on SWEBench Pro and TBench benchmarks. This version introduces significant user experience improvements, including mid-turn updates and non-interruptive steering, alongside a 25% inference speed increase.
gpt-5.3-codexopenaillm-updatescode-generationai-performanceswebench-protbench
“GPT-5.3-Codex achieves state-of-the-art performance on SWEBench Pro and TBench.”
tweet / embirico / Feb 5
OpenAI has released GPT-5.3-Codex, a new model demonstrating state-of-the-art performance on SWEBench Pro and TBench benchmarks. This iteration focuses on improving the user experience through real-time updates and non-interruptive steering. Additionally, the model achieves a 25% speed increase due to inference optimizations.
gpt-5.3-codexllm-updatesai-modelsopenaiinference-improvementsdeveloper-tools
“GPT-5.3-Codex is the new state-of-the-art model for code generation benchmarks.”
tweet / embirico / Jan 30
GPT-5.2 models exhibit significantly improved performance in extended autonomous tasks compared to previous iterations. Key advancements include enhanced instruction adherence, sustained focus, and reduced task drift. This directly translates to more precise and complete task implementations, indicating a substantial leap in agentic AI capabilities.
gpt-5.2autonomous-agentsllm-evaluationai-capabilitiescursor-aiagentic-ai
“GPT-5.2 models are superior in extended autonomous work.”
tweet / embirico / Jan 14
GPT-5.2-Codex has been released in the API, offering enhanced capabilities for complex coding tasks. This model excels at long-running operations such as feature development, code refactoring, and bug detection. Additionally, it possesses advanced cyber capabilities, making it proficient in identifying and understanding codebase vulnerabilities.
llm-updatescoding-assistantsopenaideveloper-toolssoftware-engineering
“GPT-5.2-Codex is now accessible via API.”
tweet / embirico / Jan 6
OpenAI is evaluating the continued use of Codex-based models, specifically older versions like gpt-5 and gpt-5.1, and their Codex variants. This suggests a potential deprecation of these models.
llm-deprecationscodexgpt-modelsmodel-developmentapi-management
“OpenAI is considering deprecating certain Codex-based models.”
youtube / embirico / Dec 14
OpenAI's Codex is disrupting software engineering by augmenting human capabilities rather than replacing them. Its rapid adoption and evolution, particularly with the release of GPT-5.1 Codex Max, demonstrate a significant acceleration in development cycles and problem-solving, even enabling internal projects like the Sora Android app to ship at unprecedented speeds. The core philosophy centers on building proactive AI "teammates" that integrate seamlessly into developer workflows, with a long-term vision of empowering AI agents to interact with computers by writing code, transcending traditional human limitations in speed and multitasking. This bottom-up, empirical approach to product development, coupled with a focus on human empowerment, positions Codex as a pivotal tool for future software creation and broad AI integration.
ai-agentscoding-assistantdeveloper-toolsproduct-managementstartup-growthfuture-of-workai-accelerating-development
“OpenAI’s Codex has achieved explosive growth, with a 20x increase in usage since August.”
youtube / embirico / Dec 5
OpenAI's Codex Max is significantly enhancing developer productivity through its advanced coding agent capabilities. This includes features like the new GPT 5.1 Codex Max model, which offers improved coding intelligence and efficiency, and tools such as agent-driven code review and task management integration. These advancements allow engineers to offload routine tasks, collaborate more effectively with AI, and accelerate development cycles, as evidenced by the rapid development of the Sora Android app.
ai-coding-agentsopenai-codexide-extensionsdeveloper-productivityllm-applicationssoftware-developmentai-workflows
“Codex Max, powered by GPT 5.1, demonstrates superior coding intelligence and token efficiency compared to previous models.”
youtube / embirico / May 16 / failed
youtube / embirico / May 16
OpenAI's Codex (built on the "Codex-1" model) is architected as a fully cloud-hosted autonomous coding agent — not merely a hosted CLI — with deliberate design choices around independent long-running task execution (up to 60 minutes), sandboxed compute environments, and parallelism (up to 60 concurrent tasks/hour). The core product philosophy pushes complexity into the model via training rather than deterministic scaffolding, with the explicit goal of avoiding brittle state machines in favor of emergent model behaviors. Key developer best practices identified include maintaining an agents.md instruction file, enforcing linting/formatting for in-loop verification, writing modular/testable code, and adopting an "abundance mindset" — firing off many parallel tasks with minimal prompt crafting rather than treating it like an interactive IDE. The team frames Codex-1 as a "college grad with a few years of job experience starting their first day at your company," where agents.md compresses the onboarding surface.
ai-coding-agentsdevtoolsopenaiagentic-aillm-infrastructuredeveloper-experiencesoftware-engineering-ai
“Codex tasks have a hard cut-off of one hour, with an average task completion time significantly below 30 minutes, and a concurrency limit of 60 tasks per hour per user during the research preview.”