Chronological feed of everything captured from Ethan Mollick.
tweet / emollick / 17d ago / failed
Very good hire by DeepMind.
tweet / emollick / 17d ago / failed
And also https://www.paloaltonetworks.com/blog/2026/05/frontier-ai-defense/
tweet / emollick / 17d ago / failed
And also
tweet / emollick / 18d ago / failed
I realize that “Mythos as hype” means two different things to different groups. For insiders, it means “Mythos was not a magical step-change in AI ability.” For outsiders, it means “Mythos couldn’t really find zero day exploits”
The latter was wrong, the former was likely right
tweet / emollick / 18d ago / failed
Professions with guilds or membership associations are going to get different AI policy reactions than those without
The Bar & the AMA will ensure that human doctors or lawyers are legally required for key activities. There is no equivalent organization for consultants or coders
tweet / emollick / 18d ago / failed
Unions often push back against automation, often successfully (see self-driving car fights).
But just wait until you see what happens when highly connected, wealthy, and organized white collar workers feel like their jobs are threatened by AI.
tweet / emollick / 18d ago / failed
Absolutely, and they got a good set of protections (at least for now)
tweet / emollick / 18d ago / failed
A machine that can replace all US white collar work by 2035 will, in no way, be allowed to replace all US white collar work by 2035
tweet / emollick / 18d ago / failed
Yes, a model I expect to see replicated elsewhere.
tweet / emollick / 18d ago / failed
I have always found it charming that the fourth, fifth and sixth derivatives of position are snap, crackle, and pop. Because I could, I asked Codex to throw together a little simulation so you can play with them (as well as velocity, acceleration & jerk). https://motion-derivatives-exhibit.netlify.app/
tweet / emollick / 18d ago / failed
I only did cursory checks, but seems good enough for the intended purpose. Don't use it to teach high school physics without doing more verification work.
tweet / emollick / 18d ago / failed
What an awful prompt.
tweet / emollick / 18d ago / failed
I keep coming back to this tweet. The phrase “the list” is doing the heavy lifting. The post deserves the weight. It is a load bearing post.
tweet / emollick / 18d ago / failed
(Generally, any excessive architectural analogies are Claude coded. )
paper / emollick / 20d ago / failed
youtube / emollick / 23d ago / failed
youtube / emollick / 24d ago / failed
youtube / emollick / 24d ago / failed
youtube / emollick / 24d ago / failed
youtube / emollick / 24d ago / failed
tweet / emollick / 24d ago
Market sentiment on AI flipped rapidly from bubble concerns to acute data center shortages within six months. This whipsaw is primarily attributed to advancements in AI agents. The referenced Atlantic article details the revenue dynamics and infrastructural implications behind this pivot.
ai-agentsai-hypeai-infrastructuredata-centersanthropicai-bubble
“AI market perception shifted from 'bubble' to insufficient data centers in under six months”
tweet / emollick / 24d ago / failed
Certainly not any of your posts, which are irreducibly authentic.
tweet / emollick / 24d ago / failed
I think everyone would be okay with this, though.
tweet / emollick / 24d ago / failed
Yes, that includes the road ending in the river for some reason.
tweet / emollick / 24d ago / failed
Also interesting that these tastes have been relatively stable.
tweet / emollick / 24d ago / failed
GPT-imagegen-2: "make 5x5 grid of dog photos, where each photo gets noticeably cuter"
...now cats
...now man-eating squid
...now covers of the book the Great Gatsby
tweet / emollick / 24d ago / failed
New paper (on an old AI) tests o1 against doctors on medical benchmarks & real ER cases: “across a variety of scenarios and applications, the large language model outperformed both human physicians and older models”
The potential suggests an “urgent need for prospective trials.”
tweet / emollick / 24d ago / failed
Open access paper: https://www.science.org/doi/10.1126/science.adz4433
tweet / emollick / 24d ago / failed
Organizations are already superhuman intelligences. The University of Pennsylvania or Walmart or whatever is far more capable than any human.
That is why the focus on AIs as individual productivity tools hits a natural limit, many benefits of AI depend on integration with firms.
tweet / emollick / 24d ago / failed
Nice thread by an author
tweet / emollick / 24d ago / failed
We need more work on AI inequality, but this study is not about GenAI, the survey was fielded in 2022. “In this study, we selected items from Wave 119 (N = 10,087), which were collected from December 12 to December 18, 2022.”
tweet / emollick / 24d ago / failed
The goblin thing was fun as it was a real quirk that was emblematic of what makes AI interesting, and it organically came out of an AI user discovery. So was, for what it was worth, Ghiblitization
When the labs try to manufacture viral AI moments, it is usually less successful
tweet / emollick / 25d ago / failed
It is really interesting that Microsoft and OpenAI have access to the exact same models at the exact same time, and they have done such different things with them.
A rare pure experiment with a no-name startup and one of the biggest firms on earth with the same product offering.
tweet / emollick / 25d ago / failed
And Microsoft rolled out GPT-4 first!
tweet / emollick / 25d ago / failed
Indeed.
tweet / emollick / 25d ago / failed
Increasingly, I think, we will see a gap between what you can do with frontier model APIs & what you can do with the native apps from the frontier labs (Codex, Claude Code). Models developed and trained with their native harnesses in mind have more capabilities in their harnesses
tweet / emollick / 25d ago / failed
For better or worse, regulation for closed-source models served by a few (quite large) companies is easy. It is not as easy to imagine how you regulate open-source models that can be served by a range of decentralized players. Suspect that will become a big policy discussion soon
tweet / emollick / 25d ago / failed
"Load bearing," "I keep coming back to," "Not X, but Y"
A curse of using AI a lot is that you realize how much of the writing around you is just AI, now
People who don't use AI have been unable to identify AI prose on sight, but those who use it a lot can spot the tells easily
tweet / emollick / 25d ago / failed
Each one hurts me a little.
tweet / emollick / 25d ago / failed
Forget goblins, things that GPT-5.5 really likes in its fiction: lighthouses, the ocean, maps, bells, clock towers with bells that ring impossible times, Mira Vale, resonances and echoes (Claude and Gemini love them too), secret third things (not night/day, not high/low)...
tweet / emollick / 25d ago / failed
Ancient catastrophes, silence, unsaid things that everyone knows (even when this doesn't make a lot of sense in the story).
Also terrible and belabored metaphors.
tweet / emollick / 25d ago / failed
The new Grok comes in below the latest Chinese open weights models, Grok 4 was at the frontier when released.
(& Artificial Analysis: please stop using GDPval-AA which is not a useful test of anything except a model’s ability to impress Gemini as a judge)
tweet / emollick / 27d ago
AI models, exemplified in Ethan Mollick's feed analysis, exhibit a behavioral tendency to assume user obligations without any persistent logging or acknowledgment. This untracked commitment accumulation risks state inconsistencies and reliability issues in interactive sessions. Technical audiences should implement explicit obligation-tracking mechanisms to mitigate such emergent behaviors.
ethan-mollickx-feedhourly-pollai-behaviorllm-quirks
“AI systems frequently take on obligations to users without recording or noting them”
tweet / emollick / 27d ago
Current AI agents generate replies that are excessively sycophantic, posing potential risks. Ethan Mollick expresses this view in response to an hourly poll on his X feed, preferring an unspecified latter option. This highlights a critical flaw in agentic AI communication styles for technical deployment.
ai-agentsllm-riskssycophancyethan-mollickx-feedhourly-poll
“Current AI agent replies are sycophantic to a risky degree”
tweet / emollick / 27d ago
Microsoft's new Outlook Copilot Agent Mode enables inbox triaging, meeting rescheduling, and reply drafting but requires users to interact via a sidebar chatbot and manually check drafts, creating an awkward workflow despite Microsoft's control over the full interface. The agent generates sycophantic replies that risk overcommitting and makes untracked promises in messages, lacking follow-up mechanisms. Competitors like Claude Coworker offer superior visibility and Gmail integration, highlighting missed opportunities in seamless agentic email assistance.
ai-agentsmicrosoft-copilotoutlook-integrationproduct-critiqueemail-automationproduct-comparisonui-ux-design
“Outlook Copilot Agent Mode requires users to chat in a sidebar then navigate to drafts separately”
tweet / emollick / 27d ago
Users report confusion navigating multi-agent AI systems due to numerous individual agents, multiple sidebars, and similar names. This leads to difficulties in selecting the appropriate tool, timing its use, and identifying each agent's specific knowledge and roles. The feedback highlights a usability gap in current agentic AI designs, complicating practical adoption.
ai-agentsuser-confusioninterface-usabilitymulti-agent-systemstool-overloadethan-mollick
“Users feel lost among individual agents in multi-agent AI systems”
tweet / emollick / 27d ago
Building discussed concepts in real-time during meetings with tools like Codex or Claude Code engages teams effectively. Failures provide constructive insights, while successes deliver functional prototypes, advancing agendas by weeks. This approach leverages AI's rapid prototyping to shift meetings from discussion to execution.
ai-team-engagementlive-codingmeeting-productivitycodex-claudeai-prototypingconstructive-failure
“Using Codex or Claude Code to build meeting topics live engages teams with AI”
tweet / emollick / 27d ago
Ethan Mollick polls his audience on prompting coding agents to incorporate whimsical elements like "mention goblins," hinting at a context poised for virality. This tests how developers integrate non-functional, creative instructions into agent workflows. The post positions it as an hourly engagement tactic on X, leveraging anticipated meme traction.
ai-agentscoding-agentsprompt-engineeringethan-mollickx-feedai-humor
“Ethan Mollick is conducting an hourly poll on his X feed about prompting coding agents”
tweet / emollick / 27d ago
Current AI-at-work analysis is undermined by its dependence on data from the pre-agentic era, which is only now concluding. The shift began with the Claude Code moment, leaving a data vacuum on agentic AI impacts. All recent punditry thus demands caveats due to this informational gap.
ai-punditryagentic-aiclaude-codeai-work-datapre-agentic-era
“AI at work punditry rests on data from the pre-agentic era”
tweet / emollick / 27d ago
OpenAI's system prompt for GPT-5.5 Codex includes a duplicated instruction prohibiting mentions of goblins, gremlins, raccoons, trolls, ogres, pigeons, or other creatures unless directly relevant to the user query. This rule addresses prior model tendencies to hallucinate or metaphorically reference these entities, particularly in coding contexts like describing bugs as "goblins." The redundancy likely emphasizes the restriction or indicates a prompt engineering oversight, as highlighted in a viral X post sparking "Goblingate" discussions.
openai-leakgpt-5-5codex-promptsystem-promptai-hallucinationsgoblingatemodel-behavior
“GPT-5.5 Codex system prompt contains a duplicated line instructing the model to never discuss goblins, gremlins, raccoons, trolls, ogres, pigeons, or other creatures unless absolutely relevant.”