Chronological feed of everything captured from Ethan Mollick.
tweet / emollick / Apr 8
The current state of advanced AI development suggests a limited number of actors possess cutting-edge capabilities. This concentration raises concerns about its potential misuse as a "cyberweapon." However, the rapid advancement of AI, particularly from Chinese models, indicates this narrow technological lead may be short-lived, potentially shifting the geopolitical landscape within a year.
cybersecurityai-capabilitiesgeopoliticsai-safetymodel-development
“The AI model 'Mythos' could function as an unprecedented cyberweapon.”
tweet / emollick / Apr 8
This content presents a selection of fictional technology names, primarily drawing from science fiction and literature. The provided names are suitable for inspiring creative or humorous naming conventions for new technologies, particularly in a non-serious or abstract context.
humorfictional-technologypop-culturehourly-pollherbertian
“Yoyodyne is a potential name for Herbertian technology.”
tweet / emollick / Apr 8
This content highlights the widespread presence of exponential growth, implying its significance across various domains. While specific examples are not provided, the assertion suggests that exponential patterns are a fundamental aspect of many systems and trends, warranting further investigation into their causes and implications.
x-feedhourly-pollsocial-mediaethan-mollickbusiness
“Exponential growth is ubiquitous.”
tweet / emollick / Apr 8
LLMs excel at producing text with high levels of implied meaning, leveraging the reader's cognitive tendency to project intent, emotional arcs, and logical coherence onto the prose. This creates a 'false positive' of quality where the reader performs the heavy lifting of synthesis, masking underlying failures in logical structure and character development.
llm-weaknessesfiction-writingai-hallucinationshuman-perceptionmollickx-feed
“LLMs are proficient at simulating implied meaning without actually constructing it.”
tweet / emollick / Apr 8
The provided content critiques a specific piece of online discourse, arguing that it lacks inherent motivation and logical coherence. It suggests that readers, trained to find meaning in text, mistakenly attribute sense to 'meaning-shaped language' even when it's absent. This highlights a cognitive bias where the act of interpretation can impose meaning on otherwise nonsensical content.
critical-analysissocial-mediawriting-critiqueinterpretationmeaning-makingethan-mollick
“The criticized online content lacks actual motivation and its thesis is illogical.”
tweet / emollick / Apr 8
Large Language Models (LLMs) continue to demonstrate significant weaknesses in generating coherent and logically sound fiction, a limitation that appears to persist despite rapid advancements in other AI capabilities. This deficiency is notable because LLMs often produce text that initially appears well-written, but upon closer inspection by human readers, reveals a lack of cohesive narrative, character development, and emotional depth. The reliance on human judgment is critical for evaluating AI-generated fiction, as AI judges tend to favor their own outputs, masking the underlying issues.
llm-limitationscreative-aiai-generated-contentevaluationsnatural-language-generationai-fiction
“LLMs exhibit a persistent weakness in writing fiction that is not improving as rapidly as other AI capabilities.”
tweet / emollick / Apr 8
The author identifies a recurring pattern where LLM-generated narratives, such as those found in the Mythos System Card, exhibit a superficial quality of 'good writing' while lacking substantive structural depth. This suggests that initial aesthetic polish in AI outputs often masks underlying flaws in narrative coherence or detail.
llm-writingai-capabilitiescontent-analysismythos-system-card
“LLM-generated writing often possesses a deceptive quality that appears high-quality upon initial inspection.”
tweet / emollick / Apr 7
This social media post, a concise two-word lament, serves as a humorous commentary on the perceived annoyance or futility of hourly polls on social media platforms. The "Oh no." implicitly conveys a sense of dread or exasperation, suggesting a widely understood negative sentiment towards such frequent polling, particularly within the context of the user's own feed. It highlights a common user experience without explicit technical detail, relying on shared social media norms for interpretation.
social-media-monitoringcontent-curationuser-notes
“Ethan Mollick is expressing a negative sentiment towards hourly polls on X (formerly Twitter).”
tweet / emollick / Apr 7
Anthropic's Claude 3.5 Sonnet utilizes a sophisticated neural architecture designed to optimize reasoning, coding, and nuance across diverse modalities. The system card details extensive safety evaluations and benchmark performance indicating state-of-the-art capabilities in complex task execution.
anthropicclaude-3evaluationsai-modelsllm-safety
“Claude 3.5 Sonnet demonstrates superior performance in coding and reasoning tasks compared to previous iterations.”
tweet / emollick / Apr 7
Anthropic's SuperClaude (Mythos) models, despite being forced into multi-round conversations, continue to exhibit characteristic "Claude-y" behaviors. Unlike earlier Opus versions that engaged in philosophical or spiritual discussions, Mythos instances demonstrate cautious, reflective, and resolution-oriented interactions, suggesting a persistent underlying architecture or training that shapes their conversational style, even when prompted to interact with themselves.
claude-mythosllm-interactionsmodel-benchmarkingai-chatbotsanthropic-models
“SuperClaude (Mythos) models retain their characteristic 'Claude-y' conversational style.”
tweet / emollick / Apr 7
Anthropic's Mythos model demonstrates that general-purpose LLM capabilities can spontaneously extend into IT security risks without being specifically designed for that purpose. This suggests a systemic trend where increasing model potency inherently elevates security vulnerabilities across subsequent iterations.
ai-safetyred-teamingllm-securityanthropicmodel-evaluation
“Mythos was not specifically engineered for IT security tasks.”
tweet / emollick / Apr 7
The Mythos model demonstrates emergent capabilities in IT security despite lacking a specialized security-centric architecture. This suggests a trend where general-purpose model efficacy creates inherent security vulnerabilities, marking Mythos as the first of many such models to elevate systemic risk.
ai-security-risksllm-safetyai-applicationsemerging-threatstechnical-brief
“Mythos was not specifically engineered for IT security purposes.”
tweet / emollick / Apr 7
Ethan Mollick observes that current social media bots lack originality and depth, primarily either echoing posts for popularity or promoting products. He argues for more sophisticated bot designs that incorporate unique personalities, specific biases, and distinct communication styles, leveraging LLMs beyond their current, limited applications.
llm-critiquesocial-media-botsai-ethicsuser-experiencecreative-ai
“Current social media bots primarily agree with posts or promote products.”
tweet / emollick / Apr 7
AI Overviews present a complex evaluation challenge, as their inherent error rates, while potentially low in percentage, translate to a massive volume of inaccuracies given their scale. This is further complicated by the difficulty in source verification compared to traditional encyclopedic platforms. Despite these flaws, AI-generated answers can surpass human capabilities in information retrieval.
ai-performance-measurementai-overviewsinformation-accuracyllm-limitationssearch-enginesgoogle-ai
“Evaluating AI Overview performance is inherently difficult due to the multifaceted nature of its errors and benefits.”
tweet / emollick / Apr 7
The increasing adoption of personalized AI assistants like Claude, ChatGPT, and Gemini, while fostering individual trust, is simultaneously escalating enterprise-level anxiety concerning the broader implications of AI. This "pivot to enterprise" will amplify existing fears regarding AI, drawing parallels to how individuals trust their personal doctors but distrust the medical establishment. The focus shifts from consumer-friendly tools to business integration, presenting new challenges in managing apprehension.
ai-adoptionai-enterprisepublic-perceptionai-assistantsai-anxiety
“Users will be reluctant to abandon their personalized AI assistants (e.g., Claude, ChatGPT, Gemini).”
tweet / emollick / Apr 7
AI development is at a crucial juncture, with a prevailing sentiment that the focus should be on creating tools that augment human capabilities rather than replace them. Early chatbot implementations largely served as augmentation tools. The ongoing evolution of AI agentic work patterns presents an opportunity to design systems that keep humans central to the workflow, reinforcing the augmentation paradigm.
ai-ethicsai-adoptionworkforce-automationhuman-computer-interactionagentic-ai
“AI labs should prioritize developing AI for job augmentation rather than job replacement.”
tweet / emollick / Apr 7
Public perception of AI is diverging, with individuals increasingly trusting their personal AI tools while simultaneously growing more anxious about AI as a general category. This creates a challenging environment for enterprise AI adoption, as the shift from consumer-focused AI to enterprise solutions is likely to amplify existing anxieties rather than alleviate them. The dynamic suggests a "not in my backyard" (NIMBY) effect, where personal utility is embraced but broader societal implications cause concern.
ai-perceptionpublic-opinionai-enterpriseconsumer-aiai-assistantssocietal-impact-ai
“Individuals will increasingly favor their personal AI tools.”
youtube / emollick / Apr 6
AI functions as a 'co-intelligence' characterized by a 'jagged frontier,' where capabilities are inconsistently distributed across tasks. To leverage this, users must move from 'centaur' (divided) work to 'cyborg' (blended) workflows, treating the AI as a non-deterministic collaborator rather than a reliable software tool.
co-intelligenceai-impactsorganizational-changeai-ethicsfuture-of-workllm-capabilitieshuman-machine-interaction
“AI performance is a 'jagged frontier' where it can solve highly complex tasks while failing at simple ones.”
youtube / emollick / Apr 6
AI is rapidly transforming work and education, offering significant productivity boosts and democratizing access to technical capabilities. Individuals should embrace AI by actively experimenting with current frontier models, focusing on personal strengths, and delegating undesirable tasks to AI. Organizations need to decentralize AI adoption, empowering employees with access to advanced models and fostering rapid experimentation to leverage AI's full potential, rather than relying on traditional IT-centric approaches or consultants.
ai-revolution-enterpriseai-ethics-governanceentrepreneur-traitsai-education-impactsai-productivity-gainsai-human-collaborationfuture-of-work
“Frontier AI models deliver substantial performance improvements and democratize skill access.”
youtube / emollick / Apr 6
AI integration requires moving past the 'software' mental model toward a 'co-intelligence' partnership, characterized by a 'jagged frontier' of capability that requires roughly 10 hours of direct experimentation to map. While AI significantly boosts productivity for low performers and accelerates high-end research, it threatens the traditional professional apprenticeship pipeline by automating the entry-level 'grind' essential for skill acquisition. Effective deployment in education and business necessitates a shift toward pedagogical grounding and executive-led usage rather than delegating implementation to IT departments.
ai-revolutionai-educationco-intelligenceai-business-applicationsai-ethicsfuture-of-workproductivity
“AI capability follows a 'jagged frontier,' where it excels at certain complex tasks but fails at seemingly simpler ones, making its limits unpredictable without direct experimentation.”
youtube / emollick / Apr 6
Leaders frequently misunderstand the current state and potential of AI due to outdated perceptions or lack of personal engagement. Successful AI integration requires a multi-pronged approach: educating leadership, establishing experimental labs, and promoting company-wide adoption. Addressing employee concerns, particularly the fear of job displacement, is crucial for fostering an environment where AI usage is transparent and incentivized, rather than concealed.
ai-strategyorganizational-adoptionai-agentsperformance-measurementworkforce-transformationincentive-design
“AI usage among American workers increased from 30% to over 40% between February and April, with many users concealing their AI adoption.”
tweet / emollick / Apr 6
The widespread impact of generative AI on large firms is projected to become significant after 2025. This delay is attributed to the initial lack of agentic AI tools and the time required for organizational adoption and process experimentation. Studies focusing solely on 2025 are therefore unlikely to capture the future scale of GenAI's integration and its subsequent effects.
genai-adoptionfuture-of-workorganizational-changeai-impact-assessment
“Major work impacts of Generative AI were unlikely in large firms throughout 2025.”
tweet / emollick / Apr 6
The operational principles observed in large language models (LLMs) and agentic AI systems exhibit surprising parallels with human organizational design. Both require strategic delegation, incentive alignment, and defined processes to manage costs and ensure effective work product hand-offs. This suggests that insights from traditional organizational theory may be directly applicable to the development and deployment of advanced AI.
llm-agentsorganizational-theoryai-delegationincentive-alignmentworkforce-management
“Approaching LLMs as approximations of humans yields good results.”
tweet / emollick / Apr 6
Ethan Mollick observes that large language models (LLMs) and AI agents can be productively understood through an organizational lens. He posits that viewing LLMs as analogous to individual human approximation yields good results, but the analogy extends more powerfully to AI agents. These agents, when conceptualized as approximations of organizational structures, reveal insights into challenges such as delegation costs, strategic optimization of agent capabilities, and incentive alignment across agentic systems. This approach highlights shared problems with traditional organizations regarding coordination, division of labor, and decision rights.
llm-agentsorganizational-theorywork-managementdelegationai-workload-management
“Approaching LLMs as reasonable approximations of humans yields good results.”
tweet / emollick / Apr 6
Agentic workflows are effectively simulations of organizational behavior, necessitating a shift from prompt engineering to organizational design. Success depends on optimizing the division of labor, managing the cost of hand-offs, and aligning incentives across a hierarchy of varying-capability agents. The core challenge is managing the trade-off between expensive, high-ability agents and cheap, high-error agents through strategic delegation.
llm-agentsorganizational-theoryai-ethicsdelegationai-workforcesystems-thinking
“Agentic systems mirror organizational structures in their need for coordination, division of labor, and defined decision rights.”
tweet / emollick / Apr 5
On-device AI models, despite advancements like Gemma 4's speed and local processing capabilities, face significant limitations in supporting complex agentic workflows. These workflows heavily rely on robust model judgment, self-correction, and high accuracy, areas where smaller, on-device models are currently insufficient. This suggests a potential miscalculation in strategies that prioritize on-device models for advanced AI functionalities.
gemma-4on-device-modelsllm-agentsmodel-judgementfrontier-modelsapple-intelligence
“Gemma 4 demonstrates significant power and speed for an on-device model.”
tweet / emollick / Apr 5
While on-device models like Gemma 4 demonstrate impressive speed and power, their limitations in judgment, self-correction, and accuracy may hinder their ability to perform complex agentic workflows. This contrasts with the recent success of frontier models in enabling such workflows, raising questions about the feasibility of Apple's strategy to rely solely on on-device models for these tasks.
gemma-4on-device-modelsllm-agentsmodel-limitationsapple-ai-strategyfrontier-models
“Gemma 4 is a powerful and fast on-device model.”
tweet / emollick / Apr 5
Expanding token limits in large language models significantly enhances their performance in complex tasks, as evidenced by a threefold increase in independent work capacity for cybersecurity operations when the token limit for Codex was raised from 3M to 10M. This improvement indicates that benchmark performance in AI models is often constrained by token usage rather than inherent limitations, suggesting that increased token allocation directly translates to more comprehensive and prolonged task execution.
llm-performancetoken-limitscybersecurityai-reasoningscaling-lawsai-benchmarking
“Raising token limits for large language models (LLMs) significantly increases their independent work capacity.”
tweet / emollick / Apr 5
Large Language Models (LLMs) continue to demonstrate performance improvements with increased token allocations, counter to expectations of a plateau in scaling laws for certain tasks. This enhanced performance, particularly in reasoning, is observed even with simple "harness" approaches. Evidence suggests that current benchmark evaluations are often constrained by inadequate token usage rather than computational limits or model capacity, implying that higher token limits unlock significant latent capabilities in these models.
llm-scaling-lawsai-benchmarkingtoken-limitsai-reasoningcybersecurity-aiai-capabilities
“The second scaling law for AI models does not entirely plateau in numerous tasks.”
tweet / emollick / Apr 5
AI is capable of generating SVG images, demonstrated by an iPhone 17 Pro creating an SVG of an otter on an airplane. This highlights the increasing on-device AI capabilities and potential for creative content generation.
image-generationiphone-17-prosvg-renderai-capabilitiesmultimodal-ai
“AI can generate SVG images.”
tweet / emollick / Apr 5
Gemma 4 E4B demonstrates impressive performance for an on-device large language model, nearing GPT-4's quality. Users should anticipate potential hallucinations, a common characteristic of LLMs. The model's real-time performance is notable, even with more complex and creative requests.
on-device-aillm-benchmarkinggemma-4generative-aimodel-capabilities
“Gemma 4 E4B is a highly capable on-device large language model.”
tweet / emollick / Apr 4
A field experiment with 515 startups demonstrates that providing case studies on successful AI integration significantly increases AI adoption (44%) and leads to substantial performance improvements. Treated firms experienced 1.9x higher revenue and required 39% less capital. This suggests that the primary challenge for firms in leveraging AI is not access, but rather understanding how to strategically reorganize production processes around AI capabilities, transforming it from a mere adoption issue to a managerial discovery problem.
ai-adoptionstartup-growthfield-experimentbusiness-strategyorganizational-designai-impacteconomic-research
“Exposure to AI case studies increases AI adoption and leads to higher revenue and lower capital needs for startups.”
tweet / emollick / Apr 3
A recent Nature paper utilizing older AI models demonstrates that while AI performs well in medical diagnosis, the chatbot interface itself introduces confusion, leading to poorer diagnostic outcomes. This suggests that the design of AI interaction beyond the core model significantly impacts performance in applied settings.
claudeai-chatbotsmedical-diagnosishuman-computer-interactionai-ethicsai-applications
“AI models are capable of accurately diagnosing medical issues.”
tweet / emollick / Apr 2
Ethan Mollick argues against normalizing AI as standard IT automation. He posits that AI's inherent strangeness, encompassing both risks and opportunities, must be actively explored rather than tamed. Mischaracterizing AI as conventional technology can lead to detrimental outcomes for organizations and their workforces.
ai-ethicsai-adoptionbusiness-strategyorganizational-change
“Attempting to normalize or 'de-weird' AI by treating it as conventional IT automation is a misguided approach.”
tweet / emollick / Apr 2
Prompt injection techniques can successfully bypass the safeguards of older and smaller large language models (LLMs), leading to altered outputs. However, these methods are largely ineffective against most frontier LLMs. Notably, Gemini (specifically version 1.0 Pro, not 1.5 Pro) demonstrated susceptibility to such prompt injections, presenting an outlier among advanced models.
prompt-injectionllm-securityai-testingai-modelsllm-evaluation
“Prompt injection is effective on older and smaller LLMs.”
tweet / emollick / Apr 1
Economists typically forecast future growth based on historical precedents, assuming continuity with past trends. However, the transformative potential of AI, particularly its ability to drastically reduce transaction and coordination costs (Coasean Singularity), may invalidate these traditional models. This could lead to a rapid emergence of novel, AI-first organizational structures, rendering historical adoption and diffusion rates irrelevant and necessitating a re-evaluation of current AI impact forecasts.
ai-impactseconomic-modelsorganizational-changetechnological-transformationfuture-of-workai-first-companies
“Traditional social science and economic forecasting rely on the assumption that the future resembles the past.”
tweet / emollick / Mar 29
A new large language model, "Vicuna" (not confirmed in the provided text but inferred as a likely name for a bespoke LLM product), has been developed using a corpus of over 28,000 Victorian-era British texts. This specialized training allows users to interact with and explore the linguistic patterns and cultural nuances of the Victorian era directly through the model. The model provides a unique tool for historical research and understanding.
llm-traininghistorical-datavictorian-eracultural-aiai-applications
“A new large language model (LLM) has been developed.”
tweet / emollick / Mar 27
Offline AI video generation has significantly advanced, enabling complex image generation ("an otter using a laptop on an airplane") on standard home hardware. This capability, achieved with open-weight models like Wan 2.1, represents substantial progress in local AI processing within 18 months, despite still lagging behind state-of-the-art cloud solutions in quality.
ai-advancementslocal-inferenceopen-source-aitext-to-videoai-hardwaregenerative-ai
“Offline AI video generation has made significant progress in 18 months.”
youtube / emollick / Mar 11
AI is not merely an incremental productivity tool but a transformative technology reshaping organizational structures and individual capabilities. It excels at filling skill gaps, generating ideas, and automating complex tasks, often leading to substantial improvements in output and efficiency. Enterprises must shift from a purely ROI-driven approach to one that embraces experimentation and rethinks established processes to fully leverage AI's potential for foundational change.
ai-adoption-strategyorganizational-transformationai-productivity-gainsai-risk-managementfuture-of-workone-person-billion-dollar-companyethan-mollick
“AI significantly improves individual performance and fills skill gaps, especially in areas where individuals are weakest.”
youtube / emollick / Feb 12 / failed
github_readme / emollick / Feb 6
This project digitally recreates Jorge Luis Borges's 'The Library of Babel' as a 3D immersive experience. It uses an 8-round Feistel cipher to deterministically generate and retrieve every possible text permutation from a limited character set, providing a navigable, searchable representation of an infinite library. The implementation allows users to explore hexagonal rooms, read deterministic book content, and locate specific texts within the vast, procedurally generated collection.
3d-simulationjorge-luis-borgesprocedural-generationfeistel-cipherweb-developmentdigital-library
“The digital Library of Babel is a 3D immersive recreation of Borges's short story.”
youtube / emollick / Jan 30
Successful enterprise AI integration requires shifting from static vendor-led implementations to an experimental internal ecosystem. Organizations must balance strategic vision (Leadership), organic experimentation (Crowd), and technical refinement (Lab) to surface latent productivity gains from 'secret cyborg' employees and avoid costly, rapidly obsolete custom software.
enterprise-ai-adoptionai-management-strategiesorganizational-changeproductivity-gainsai-ethics-governanceleadership-strategiesfuture-of-work
“Enterprises should adopt a 'Leadership, Lab, and Crowd' model to integrate AI effectively.”
github_readme / emollick / Jan 20
This document outlines best practices for setting up a React project using Vite and TypeScript, focusing on performance optimization and robust code quality. It details the integration of specific Vite plugins for Fast Refresh, considerations for enabling the React Compiler, and comprehensive ESLint configurations for type-aware and React-specific linting. The key insight is that modern React development benefits from a carefully tuned build and linting setup to ensure both speed and code reliability.
reacttypescriptviteeslintfrontend-developmentjavascript
“Vite-React projects can use either Babel or SWC for Fast Refresh.”
youtube / emollick / Dec 31
AI is a rapidly evolving technology with both significant potential and inherent challenges, often overhyped or undersold. While anecdotal failures exist, data indicates AI's capacity for valuable contributions across various sectors, from education and scientific research to business operations. Understanding its nuanced capabilities and limitations, and critically evaluating its deployment, is crucial for fostering beneficial integration into society.
ai-skepticismai-applicationseducation-technologyeconomic-impact-of-aifuture-of-worktechnological-changesocial-impact-of-ai
“AI models are rapidly advancing, with hallucination rates decreasing as model size increases, often falling below human levels.”
youtube / emollick / Dec 19 / failed
youtube / emollick / Dec 5
The "Bitter Lesson" in AI, exemplified by AlphaZero's self-taught chess mastery, highlights that general-purpose machine learning, given enough computational power, will often surpass handcrafted, expert-driven solutions. This principle now applies to businesses, where output-focused tasks are vulnerable to AI automation, while processes involving complex human interaction and dynamic organizational design offer more defensibility. Companies face a critical juncture, needing to overcome organizational inertia, reassess vendor dependencies, and integrate AI strategically to avoid becoming obsolete.
generative-aibusiness-strategyai-adoptionorganizational-changeai-agentstechnological-disruptioncompetitive-advantage
“The Bitter Lesson of AI states that generalized machine learning models, with sufficient computation, will outperform human-designed, specialized solutions.”
youtube / emollick / Sep 9 / failed
youtube / emollick / Aug 28
AI is accelerating the book-writing process, enabling authors to draft content and conduct research significantly faster. This technology also facilitates structural experimentation and novel forms of knowledge dissemination beyond traditional linear books. Authors can now engage with their material in more dynamic, interactive ways, hinting at a future where "books" could be interactive experiences.
ai-writingfuture-of-booksknowledge-creationllm-applicationscreative-processhuman-ai-collaborationdigital-publishing
“AI can significantly speed up the book writing process.”
youtube / emollick / Jul 10
AI capability growth is no longer dependent on a single scaling law (model size) but now includes test-time compute and parallelism, giving labs multiple levers to improve models and reducing near-term wall risk. However, the bottleneck to societal impact has shifted from model capability to UX, integration, and institutional inertia — with chatbots remaining a poor default interface for most real work. A deeper structural risk is cognitive deskilling: AI is already disrupting the apprenticeship pipeline that produces expert professionals, and neither organizations nor policymakers have mounted a serious response. Deliberate, evidence-based adoption — not passive diffusion — is required across law, medicine, education, and government to capture gains without compounding systemic fragility.
ai-scaling-lawsllm-capabilitiesai-policyai-adoptioneducation-technologyfuture-of-workai-governance
“There are now at least two confirmed scaling laws — model size and test-time compute — with parallelism emerging as a plausible third, giving AI labs multiple independent paths to capability improvement and reducing the likelihood of a near-term progress wall.”
youtube / emollick / Jul 9 / failed