Chronological feed of everything captured from swyx.
youtube / swyx / 17h ago
Recorded live from AI Engineer Europe in London, this episode covers the week's major AI developments: Anthropic's preview release of Claude Mythos (restricted to ~40 enterprise partners under Project Glass Wing due to unprecedented cybersecurity capabilities), the rapid growth of agentic coding tools like Open Claude and Codex, and the emerging consensus around "harness engineering" and context engineering as durable pillars of agent-based AI. Swyx and other practitioners argue that extreme parallelism in multi-agent systems is placing unprecedented load on infrastructure like GitHub, while security concerns around supply-chain attacks (LiteLLM, Axios, Open Claude) are pushing developers toward zero-dependency, self-owned codebases. The episode also highlights that Anthropic's decision not to release Mythos publicly may be as much about compute constraints ($30B ARR but limited GPU capacity) as safety posture.
ai-engineeringllm-infrastructureai-securityai-agentsopen-source-aimcp-protocol
“Claude Mythos achieved 77% on SWE-bench Pro, up from ~53% for Opus 4.6 — a ~24 percentage point jump representing a massive capability leap in autonomous coding and security research.”
youtube / swyx / 1d ago
Anthropic's recent Mythos model, despite not being publicly released due to perceived security risks, showcases significant advancements in AI, particularly in coding capabilities with a 77% score on Swebench Pro. The model is accessible to select enterprise partners. This strategic approach, alongside a 10% Google investment and notable valuation increase to $30B ARR, positions Anthropic as a key player in the AI landscape, albeit with ongoing debates about accessibility and the balance between innovation and responsible deployment. The industry is also grappling with the implications of AI agents and the future of open-source tools like OpenCL and their potential use in autonomous security research.
ai-engineer-summitllm-securityai-agentsopen-source-aiai-conferencesmodel-benchmarkingdeveloper-tools
“Anthropic's Mythos model achieves a 77% score on Swebench Pro, indicating superior coding capabilities.”
youtube / swyx / 1d ago / failed
youtube / swyx / 1d ago
Anthropic's new Mythos model, while not publicly released due to perceived danger, is being deployed to major companies for cybersecurity testing, demonstrating advanced capabilities in vulnerability detection. This move highlights a growing trend of AI models being restricted to select entities, sparking debate about accessibility and the potential for a "permanent underclass" in AI development. The model's high performance in coding benchmarks and autonomous vulnerability discovery raises both excitement and concern within the AI community.
ai-engineer-summitllm-securityopen-source-modelsai-agentsconference-recapdeveloper-toolsai-community
“Anthropic's Mythos model achieves significant advancements in coding benchmarks, surpassing previous models like Opus 4.6 by over 10% in Sweepbench Pro.”
youtube / swyx / 2d ago
Traditional metrics like job creation inadequately assess startup ecosystem success; instead, "optionality" for workforce mobility and skill transference within a focused sector is a more relevant indicator. Effective ecosystem development requires intentional design beyond basic meetups and accelerators, encompassing diverse funding models, policy considerations, and university collaborations to cultivate environments where startups, not just general entrepreneurship, can thrive. South by Southwest (SXSW) serves as a unique "soft landing" program for states and regions, fostering collisions between culture, innovation, and venture capital, thereby enabling strategic multi-state collaborations, particularly in shared economic focus areas like aerospace, energy, clean tech, and AI data centers.
south-by-southweststartup-ecosystemsentrepreneurshipregional-developmentnetworking-strategy
“Traditional job creation metrics are insufficient for evaluating startup ecosystem success.”
youtube / swyx / 2d ago / failed
youtube / swyx / 2d ago
OpenAI is leveraging large language models (LLMs) to achieve highly autonomous software development. Their approach focuses on creating an AI-native environment where agents write, test, and even review code with minimal human intervention. This strategy significantly accelerates development cycles and fundamentally redefines traditional software engineering roles, enabling a small team to manage an extremely large codebase.
ai-agentssoftware-development-lifecyclellm-engineeringharness-engineeringdeveloper-productivityproduct-developmentautonomous-agents
“AI agents can develop software 10 times faster than human engineers by generating millions of lines of code and thousands of pull requests with minimal human oversight.”
tweet / @swyx / 3d ago
The discussion highlights a notable difference in the self-evident nature of technical terms, contrasting "prompt injection" with "lethal trifecta." The core insight revolves around how clearly a term's meaning is conveyed through its name, which significantly impacts its adoption and understanding within a technical community. This distinction is critical for effective communication and education in rapidly evolving fields, particularly in AI/ML safety.
prompt-injectionllm-securityai-safetyai-ethics
“The term 'prompt injection' is considered to have clear and self-evident naming.”
tweet / @swyx / 3d ago
The AI Engineer Summit website, ai.engineer, requires an update to its Open Graph (OG) image. This is a critical task to ensure proper social media previews and branding, as indicated by a direct request from a user referencing the site.
twitter-feedsocial-media-analysisai-engineerog-imagehourly-polluser-generated-content
“The AI Engineer Summit website's Open Graph image needs an update.”
tweet / @swyx / 3d ago
Swyx, a prominent figure, is currently investigating an undisclosed topic, indicating an ongoing inquiry. The lack of specific details prevents further analysis of the subject matter or potential implications. This suggests a potential future announcement or development worthy of monitoring.
swyx-feedsocial-mediahourly-pollinvestigation
“Swyx is currently investigating something.”
tweet / @swyx / 3d ago
Amazon S3 Files now offers fully-featured, high-performance file system access, differentiating it as the first and only cloud object storage with this capability. This innovation addresses the historical gap in cloud storage by providing traditional file system functionalities alongside object storage benefits, potentially streamlining data management for developers and enterprises. The announcement directly follows discussions around the lack of open-source cloud storage solutions with similar features, suggesting a timely market entry.
awscloud-storageamazon-s3cloud-infrastructureobject-storage
“Amazon S3 Files is the first and only cloud object store to offer fully-featured, high-performance file system access.”
tweet / @swyx / 4d ago
OpenAI's "Extreme Harness Engineering" initiative, as discussed in the Latent Space podcast, demonstrates a significant advancement in autonomous software development. This approach, exemplified by projects like Frontier and Symphony, enables the generation and daily processing of massive codebases (1M LOC, 1B tokens/day) with zero human involvement in both code creation and review. The methodology effectively creates "software factories" that operate without direct human intervention, raising implications for future software development paradigms.
openaiextreme-harness-engineeringfrontier-symphony-platformllm-infrastructurecode-generationsynthesis
“OpenAI's 'Extreme Harness Engineering' enables fully autonomous code generation.”
tweet / @swyx / 4d ago
A recent podcast episode explores the personal and financial lives of individuals who have amassed significant wealth through cryptocurrency tokens. The discussion aims to provide insights into the experiences and perspectives of these "token billionaires."
podcastweb3cryptocurrencytwitter-spacesbillionaires
“A podcast episode is available that discusses the lives of individuals who have become billionaires through cryptocurrency tokens.”
tweet / @swyx / 4d ago
This content highlights the use of humor, specifically encouraging costumes, and direct engagement with "customers" as a strategy for social media interaction. The intent appears to be fostering a more playful and interactive environment on platforms like X (formerly Twitter). This approach aims to boost user participation and brand likeability through non-traditional methods.
marketing-strategycustomer-engagementsocial-mediapromotional-activity
“Encouraging costumes is a method to engage an audience on social media.”
tweet / @swyx / 4d ago
The user, Swyx, humorously notes the unusual request for "lobsters" at an AI conference. This highlights a potential disconnect between typical conference expectations and a more whimsical or unexpected element being introduced, likely for comedic effect or a unique experience. The "poor guy" he briefed is probably a conference organizer or event planner.
ai-conferencestwitter-trendssocial-mediaevent-planningai-community-humor
“Swyx had to explain the desire for lobsters at an AI conference.”
tweet / @swyx / 4d ago
Swyx, a prominent figure, has activated "LOBSTER_BOOTSTRAP" and is engaging in a booth build day for @aidotengineer. The context suggests a public-facing event, potentially a conference or trade show, given the emphasis on a "booth build day."
x-postsai-engineerevent-coveragebehind-the-scenes
“Swyx has activated something called 'LOBSTER_BOOTSTRAP'.”
tweet / @swyx / 4d ago
An AI Engineer event is scheduled to take place in Singapore. The event is likely a networking or community gathering, given the invitation to "meet them," and prominently features a specific URL for registration or further information: ai.engineer/sg. This suggests a direct call to action for individuals interested in AI engineering in the region.
event-promotionai-communitynetworkingsingapore-aideveloper-event
“An AI Engineer event is being held in Singapore.”