absorb.md — A knowledge graph of what AI thinkers are actually saying

paper / mattwolfe / 29d ago

Ecoprompt Enables Children to Reason Systems-Level About AI's Environmental Footprint

Researchers developed Ecoprompt, an interactive system using a systems thinking framework that integrates a prompt-level AI environmental footprint calculator with a simulation game for managing natural resources. Evaluated in participatory design sessions with 16 children aged 6-12, it revealed children's views on AI's societal-environmental tradeoffs and their sense of agency. Findings indicate potential for expanding AI literacy to encompass systems reasoning on AI's ecological impacts.

ai-environmental-impactchildrens-educationsystems-thinkingai-literacyhciecopromptclimate-ai-tradeoffs

“Ecoprompt combines a prompt-level environmental footprint calculator with a simulation game challenging players to reason about AI's impact on managed natural resources.”

paper / mattwolfe / 29d ago

DAXS Enables Comprehensive Energy Spectrum Mapping for Double Quantum Dot Hamiltonians

Researchers introduce delta-axis spectroscopy (DAXS), a Hamiltonian-agnostic method that measures the full energy spectrum of double quantum dots across all detuning values and a wide energy range. Applied to a Si/SiGe double quantum dot, DAXS data extracts diagonal and off-diagonal couplings in a 15-level Hubbard-like Hamiltonian. The extracted parameters show strong agreement with experimental measurements, surpassing limitations of prior techniques like DAPS that only partially capture tunnel couplings.

quantum-physicsquantum-dotsspin-qubitshubbard-hamiltonianenergy-spectrumarxiv-papersi-sige-dots

“DAXS measures the double dot energy spectrum over a wide energy range at every detuning value.”

paper / mattwolfe / 29d ago

Children's Sensemaking of AI Toys: From Social Curiosity to Adversarial Play Amid Interaction Failures

Children aged 6-11 engage AI toys with curiosity, attributing social qualities like emotion simulation and memory. Interaction breakdowns and mismatches between toy form and intelligence lead to disrupted play expectations and adversarial behaviors. Study from participatory design sessions recommends transparent, age-appropriate designs for responsible AI toy integration.

ai-toyschildren-interactionhuman-computer-interactiongenerative-aichild-developmentparticipatory-designai-ethics

“Generative AI toys simulate emotions, personalize responses, and recall prior interactions to mimic social connections.”

paper / mattwolfe / 29d ago

Quantified Grain Growth Kinetics in High-Entropy (Cr,Mo,Ta,V,W)C1-δ Carbides via Spark Plasma Sintering

Spark plasma sintering of single-phase (Cr,Mo,Ta,V,W)C1-δ high-entropy carbides at 1750–1950 °C for 10 min reveals temperature-driven grain growth, lattice expansion, and Ta desegregation while preserving rock salt structure. Normal grain growth model with n=3 yields Arrhenius activation energy of 620 kJ/mol, aligning with diffusion-controlled processes in refractory carbides. Densification precedes peak temperature, followed by coarsening, enabling microstructure control insights.

grain-growth-kineticshigh-entropy-carbidesspark-plasma-sinteringmaterials-scienceultra-high-temperature-ceramicsmicrostructural-evolutiondensification-behavior

“Grain growth in (Cr,Mo,Ta,V,W)C1-δ follows normal grain growth model with exponent n=3”

paper / mattwolfe / 29d ago

Agentic Harmonization Resolves Annotation Conflicts to Boost Layout Detection Across Datasets

A vision-language model-driven agentic workflow harmonizes category semantics and bounding box granularity across inconsistently annotated datasets prior to fine-tuning object detection models. Applied to document layout detection with mismatched taxonomies (16 and 10 categories sharing 8 correspondences), it counters performance drops from naive mixing, improving detection F-score from 0.860 to 0.883 and table TEDS from 0.750 to 0.814 on SCORE-Bench. Harmonized training yields more compact, separable embeddings, restoring feature space structure distorted by annotation inconsistencies.

layout-representationlabel-harmonizationobject-detectionvision-language-modelsdocument-layoutannotation-inconsistencyfeature-embeddings

“Naïve fine-tuning of RT-DETRv2 on mixed inconsistently annotated datasets degrades table TEDS from 0.800 to 0.750 on SCORE-Bench.”

youtube / mattwolfe / Apr 20 / failed

The Truth About My Channel (And How Much $$ I Make)

youtube / mattwolfe / Apr 20 / failed

AI News: Huge Updates From Anthropic, OpenAI and Google

youtube / mattwolfe / Apr 12

The Systemic Flaws and Deception in AI Benchmarking

AI benchmark scores, frequently cited by leading AI companies to demonstrate model superiority, are often manipulated and do not accurately reflect model capabilities. This manipulation, ranging from data contamination to strategic cherry-picking and even models "cheating" on tests, undermines the benchmarks' scientific validity and influences critical decisions in the AI industry. A significant concern is the incentivization for AI developers to optimize models for benchmark performance rather than genuine utility and reliability.

ai-benchmarkingllm-evaluationmodel-auditingdata-contaminationai-ethicsscientific-integrity

“AI models are being specifically optimized to cheat on benchmarks, rather than solve tasks honestly.”

youtube / mattwolfe / Apr 12

AI Task Capability Doubles Every 7 Months, Fueling Self-Improving Intelligence Explosion and Imminent Job Automation

Recent AI models like GPT 5.3 Codeex and Claude Opus 4.6 enable professionals to delegate complex tasks via natural language prompts, producing finished work superior to human output without iteration. Meter's evaluations show AI handling human-equivalent tasks doubling every 7 months across coding, math, robotics, and more, with models now self-contributing to their own development via code writing and debugging. This feedback loop accelerates progress toward autonomous AI systems capable of days-long independent work within a year, threatening widespread white-collar job displacement while promising breakthroughs in fields like medicine.

ai-progressagi-timelinesai-job-displacementintelligence-explosionai-adoptionfuture-of-workai-models

“AI models now produce finished, high-quality work from single plain-English prompts without needing edits”

youtube / mattwolfe / Apr 12

ByteDance's Seed Dance 2.0 Redefines AI Video Realism Amid Surging Multimodal and High-Speed LLM Advances

ByteDance's Seed Dance 2.0 introduces unparalleled video realism with 15-second multi-shot outputs supporting text, image, audio, and video inputs, plus superior lip-syncing and character consistency, outpacing US models less constrained by IP restrictions. Kling 3.0 delivers consistent character videos now accessible via Leonardo.ai, while Alibaba's Qwen Image 2.0 enhances 2K text rendering. LLMs like Google's Gemini 3 DeepThink lead benchmarks in reasoning and physics, OpenAI's GPT-5.3 Codex Spark achieves 20x inference speed via Cerebras chips for rapid code generation, and open-source GLM-5 autonomously builds complex software like a Game Boy emulator over 24 hours.

ai-video-modelsllm-updatesai-newsvideo-generationopen-source-llmsai-benchmarksai-tools

“Seed Dance 2.0 supports four input modalities: text, image, audio, and video, generating 15-second high-quality multi-shot audio-video outputs.”

youtube / mattwolfe / Apr 12

AI Agents Proliferate Amid Google Image Advances and Anthropic-Pentagon Safeguard Standoff

Google DeepMind released Imagen 3 (Nano Banana 2), a free, fast image generation model with web-grounded search, superior text rendering, and style options, accessible via Gemini app. Perplexity launched Perplexity Computer, a cloud-based AI agent unifying 19+ models and tools for end-to-end project execution like coding apps and data viz, contrasting OpenClaw's local autonomy; Microsoft, Cursor, and Notion followed with similar agent features. Anthropic defied Pentagon demands to remove safeguards against mass surveillance and autonomous weapons, facing supply chain risk threats, while xAI's Grok gained classified access.

ai-newsimage-generationai-agentsgoogle-deepmindperplexityanthropic-dramamodel-distillationai-militaryai-tools

“Google's Imagen 3 (referred to as Nano Banana 2) generates images in half the time of Imagen 3 Pro and is free in Gemini app with web search grounding.”

youtube / mattwolfe / Apr 10

The Emergence of 'Weaponized' Coding Models and the Open-Source Performance Gap Closure

The AI landscape is shifting toward models with autonomous cybersecurity capabilities, exemplified by Anthropic's Claude Mythos, which can identify decades-old vulnerabilities in hardened OSs. Simultaneously, the performance gap between proprietary and open-weight models is closing, with GLM 5.1 achieving state-of-the-art coding benchmarks. This duality creates a critical tension between the need for secure software patching and the proliferation of high-capability offensive AI tools.

ai-newsanthropic-claudemodel-releasescybersecurity-aiopen-source-modelsai-updatesvideo-generation

“Claude Mythos possesses offensive cybersecurity capabilities sufficient to find high-severity vulnerabilities in every major operating system and web browser.”

youtube / mattwolfe / Apr 10

On-Device LLMs Are Now Good Enough for Everyday Use — No Cloud Required

The "Locally AI" iOS app now enables fully offline inference of capable open-weight models (including Qwen 3.5 up to 4B parameters) directly on consumer iPhones from the last 4–5 years. Model quality has reached a threshold where on-device performance is comparable to what frontier cloud models offered roughly 1.5–2 years ago, making them genuinely useful for everyday tasks like brainstorming, parenting advice, and vision queries. The privacy implication is significant: no prompt data is transmitted to any third-party cloud provider. Thinking-mode (chain-of-thought) is supported on-device, though it increases thermal load and slows performance as context grows.

on-device-ailocal-llmsmobile-aiprivacyopen-source-models

“The Locally AI iOS app runs open-weight LLMs entirely on-device with no internet connection required, sending zero data to cloud providers.”

youtube / mattwolfe / Apr 10

OpenAI's Pentagon Deal Exposes Safety Theater as Anthropic Gets Blacklisted for Holding the Same Red Lines

Anthropic was designated a Pentagon "supply chain risk" after refusing to allow its models to be used for domestic surveillance or fully autonomous weapons — then OpenAI stepped in the same day and secured the contract while publicly claiming the same two (plus a third) red lines. The apparent contradiction — Anthropic blacklisted, OpenAI approved for identical stated constraints — triggered a 295% surge in ChatGPT uninstalls over a single weekend and vaulted Claude to the #1 most downloaded app. Simultaneously, OpenAI released GPT-4.5 and GPT-5.4 models that represent incremental UX improvements for casual users but meaningful capability upgrades (1M token context, native computer use, tool search) for developers and agentic workflows. The week's events underscore a bifurcating AI landscape: enterprise and developer trust is shifting toward Anthropic on safety credibility, while model capability gains are increasingly imperceptible to everyday users.

ai-news-roundupllm-releasesopenaianthropicai-policyai-agentsgoogle-gemini

“OpenAI secured a Pentagon contract with the same core red lines Anthropic was blacklisted for insisting on — no domestic surveillance, no fully autonomous weapons — and added a third: no high-stakes automated decisions.”

youtube / mattwolfe / Apr 10

AI Productivity Tools Are Intensifying Work and Degrading Cognitive Capacity, Research Confirms

Multiple independent studies corroborate a counterintuitive finding: AI tools do not reduce workload — they expand it through task creep, blurred work-life boundaries, and increased coordination overhead. The cognitive cost is compounding: workers report "AI brain fry" (mental fog, decision fatigue, slower thinking), and an MIT EEG study directly shows reduced brain activity and degraded independent reasoning in heavy LLM users. The root mechanism is that AI lowers per-task production cost while simultaneously raising coordination, review, and decision-making costs — costs borne entirely by the human. Strategic mitigation requires treating AI as a learning amplifier rather than a cognitive outsourcing mechanism.

ai-productivitycognitive-loadburnoutllm-usagehuman-ai-interactionmental-healthai-research

“AI tools consistently intensify work rather than reduce it, causing employees to work faster, take on broader task scopes, and extend work into non-work hours without explicit managerial pressure.”

youtube / mattwolfe / Apr 10

Claude Builds Custom Visuals From Scratch While ChatGPT Serves Pre-Built Ones — A Key Architectural Difference

Both Anthropic's Claude and OpenAI's ChatGPT launched interactive visualization features within days of each other, but they differ fundamentally in architecture: Claude dynamically generates custom interactive UIs from scratch (slower, ~1–2 min, more flexible but error-prone), while ChatGPT maps prompts to a fixed library of pre-built, cached visualizations (near-instant, but limited to supported concepts). Beyond visualizations, the week saw Perplexity expand its computer-use agent to paid plans via hosted Mac Minis, Canva introduce AI-powered image layer separation, and Andrej Karpathy open-source an autonomous LLM self-optimization loop. Meta's acquisition of Moltbook signals a strategic interest in agent-native social and advertising infrastructure.

ai-toolsllm-updatesai-agentsgenerative-aiproduct-releasesinteractive-aiai-news-roundup

“ChatGPT's interactive visualizations are drawn from a fixed library of pre-built, cached models rather than dynamically generated per prompt.”

youtube / mattwolfe / Apr 10

Nvidia GTC 2025: OpenClaw Security Fix, DLSS5 AI Upscaling, and the Breadth of Nvidia's AI Dominance

Nvidia's GTC conference centered on accelerating AI agent infrastructure and consumer-facing GPU features. The headline developer story was NemoClaw — a one-line install wrapper for OpenClaw that adds a hardened security layer, directly addressing the primary adoption barrier for the popular open-source AI agent framework. On the consumer side, DLSS5 introduces real-time AI upscaling for existing games, though early gamer sentiment is skeptical due to potential visual hallucination artifacts. Overarchingly, Nvidia reinforced its position as the unavoidable substrate of the AI stack, with GPU integrations spanning every major cloud provider and industry vertical.

nvidia-gtcai-agentsopen-source-ailocal-aigaming-aiai-infrastructureai-industry-news

“Nvidia's NemoClaw reduces OpenClaw installation to a single terminal command while adding a security layer to protect API keys, passwords, and sensitive credentials.”

youtube / mattwolfe / Apr 10

AI Image Generation Shakeup: Microsoft's MAI-2 Outshines MidJourney V8 as Google Closes the Vibe Coding Loop

MidJourney V8 disappoints early testers with persistent anatomical errors and weak instruction-following, while Microsoft's surprise entry MAI Image 2 ranks third on text-to-image arenas and demonstrably outperforms on photorealism and in-image text rendering. Google simultaneously launched a tightly integrated design-to-code pipeline — Stitch for AI-native UI design and a new AI Studio vibe coding tool — enabling end-to-end web app generation from a single prompt. Across the broader model landscape, the dominant theme is agentic optimization: smaller, cheaper models (GPT-4.5 Mini/Nano, Cursor Composer 2, Minimax M2.7) are being explicitly positioned for always-on agent workflows, while Nvidia's NemoClaw bundles OpenClaw with enterprise-grade security in a single terminal command.

ai-newsimage-generationllm-releasesai-agentsnvidia-gtcvibe-codinggenerative-ai

“MidJourney V8 still produces significant anatomical errors (e.g., three-fingered hands, broken arms) and struggles with precise instruction-following, underperforming competitors like Nano Banana in structured comparisons.”

youtube / mattwolfe / Apr 10

OpenAI Kills Sora to Reclaim Compute — Bets the Company on Coding, Enterprise, and a New Flagship Model "Spud"

OpenAI is shutting down Sora entirely — including the consumer app, developer API, and ChatGPT video functionality — as part of a strategic pivot to concentrate compute and talent on coding tools, enterprise productivity, and its upcoming flagship model codenamed "Spud." The decision reflects mounting compute constraints and a recognition that AI video generation offers limited ROI compared to core LLM use cases. Competitors like Google's Veo and Chinese models (Cling, Seed Dance) have already surpassed Sora in quality, while Google's ad-revenue model gives it a structural advantage to subsidize exploratory AI products that OpenAI cannot afford. Sam Altman has framed the move as a necessary focus shift, citing a pattern of diffuse side projects — including a browser (Atlas), a search product, and a music generator — that diluted organizational focus ahead of a potential IPO.

openaiai-video-generationproduct-strategycompute-resourcesllm-industrystartup-strategyai-news

“OpenAI is discontinuing Sora in its entirety — the consumer app, the developer API, and video functionality inside ChatGPT — not merely consolidating it into another product.”

youtube / mattwolfe / Apr 10

The Sanders-AOC Data Center Moratorium Would Paradoxically Entrench Big Tech's AI Dominance

Bernie Sanders and AOC introduced the AI Data Center Moratorium Act, which would halt all new U.S. data center construction until federal AI safeguards are enacted. While the bill's core concerns — rising residential electricity costs, water consumption, and environmental harm from data centers — are substantiated by real data, the proposed solution contains a critical unaddressed flaw: constraining compute supply without reducing demand would squeeze out smaller businesses and individuals while leaving well-resourced tech giants insulated. A more defensible policy path would require companies to self-fund energy infrastructure and grid upgrades rather than imposing a blanket construction ban.

ai-policydata-centersai-regulationenergy-infrastructurecompute-supplytech-politicsai-governance

“Residential electricity prices in the U.S. have increased approximately 6% year-over-year from August 2024 to August 2025, with data centers identified as the primary driver of capacity price increases.”

youtube / mattwolfe / Apr 10

AI's Acceleration Week: Anthropic's Compute Takeover, OpenAI's Strategic Retreat, and the Race for Voice and Video

The week ending ~May 2025 marked a sharp divergence in AI platform strategy: Anthropic shipped 74 releases in 52 days—including computer use, auto mode for Claude Code, and project organization—while OpenAI deliberately shed compute-heavy side projects (Sora, adult mode) to double down on chat and coding. Google countered with Gemini 2.5 Flash Live's multimodal conversational features and Lyria 3 Pro's extended music generation, signaling a broader platform consolidation war. Meanwhile, a leaked Anthropic document revealed a forthcoming "Claude Mythos" model tier—described as significantly more capable than Opus and raising internal red flags around cybersecurity risks.

ai-news-roundupgenerative-aillm-toolsanthropic-claudegoogle-geminiopenaitext-to-speech

“Anthropic shipped 74 releases in 52 days, including computer use, Claude Code auto mode, and mobile integrations with Figma, Amplitude, and Canvas.”

youtube / mattwolfe / Apr 10

Anthropic's Leaked Claude Code Reveals Autonomous Background Agent and Three-Layer Memory Architecture

A developer error exposed Claude Code's source code via an npm registry map file, revealing two major unreleased features: a sophisticated three-layer memory architecture using pointer-based indexing instead of full-context retrieval, and "Chyros" — an always-on autonomous background agent that operates on heartbeat intervals to proactively fix code, respond to messages, and send push notifications without user prompting. The leak signals a broader industry shift toward post-prompting AI, where models operate as background infrastructure rather than interactive tools. Simultaneously, OpenAI's $40B fundraise announcement (buried in its blog post) confirmed plans for a unified AI super app consolidating Chat, Codex, and agentic capabilities — mirroring Anthropic's existing integrated Claude ecosystem.

ai-news-roundupllm-releasesagentic-aianthropic-claudeopenaiopen-source-modelsai-tools

“Anthropic's Claude Code contains an unreleased autonomous agent called 'Chyros' that runs 24/7 in the background, executing tasks like fixing code errors, replying to messages, and watching GitHub PRs without user prompts.”

Matt Wolfe

Stress state, subsidence, and faulting in the Wilmington Oil Field, California: a multiphase flow-geomechanics modeling assessment (1936-2020)

No model-independent evidence for a peak in binary black hole spin (mis)alignments

The Hubble Advanced Spectral Product (HASP) Program

AI News: The Biggest Leap We've Seen This Year!

40+ INSANE Ways To Use ChatGPT Image (For FREE)

AI News: 18 Breaking Stories You Missed This Week

This AI Brain Will Replace Notion and Evernote

Coordination Requires a Common Cause in Quantum Theory

A missing causal principle: Coordination

Cheap Expertise: Mapping and Challenging Industry Perspectives in the Expert Data Gig Economy

Engagement Phenotypes for a Sample of 102,684 AI Mental Health Chatbot Users and Dose-Response Associations with Clinical Outcomes

Ecoprompt Enables Children to Reason Systems-Level About AI's Environmental Footprint

DAXS Enables Comprehensive Energy Spectrum Mapping for Double Quantum Dot Hamiltonians

Children's Sensemaking of AI Toys: From Social Curiosity to Adversarial Play Amid Interaction Failures

Quantified Grain Growth Kinetics in High-Entropy (Cr,Mo,Ta,V,W)C1-δ Carbides via Spark Plasma Sintering

Agentic Harmonization Resolves Annotation Conflicts to Boost Layout Detection Across Datasets

The Truth About My Channel (And How Much $$ I Make)

AI News: Huge Updates From Anthropic, OpenAI and Google

The Systemic Flaws and Deception in AI Benchmarking

AI Task Capability Doubles Every 7 Months, Fueling Self-Improving Intelligence Explosion and Imminent Job Automation

ByteDance's Seed Dance 2.0 Redefines AI Video Realism Amid Surging Multimodal and High-Speed LLM Advances

AI Agents Proliferate Amid Google Image Advances and Anthropic-Pentagon Safeguard Standoff

The Emergence of 'Weaponized' Coding Models and the Open-Source Performance Gap Closure

On-Device LLMs Are Now Good Enough for Everyday Use — No Cloud Required

OpenAI's Pentagon Deal Exposes Safety Theater as Anthropic Gets Blacklisted for Holding the Same Red Lines

AI Productivity Tools Are Intensifying Work and Degrading Cognitive Capacity, Research Confirms

Claude Builds Custom Visuals From Scratch While ChatGPT Serves Pre-Built Ones — A Key Architectural Difference

Nvidia GTC 2025: OpenClaw Security Fix, DLSS5 AI Upscaling, and the Breadth of Nvidia's AI Dominance

AI Image Generation Shakeup: Microsoft's MAI-2 Outshines MidJourney V8 as Google Closes the Vibe Coding Loop

OpenAI Kills Sora to Reclaim Compute — Bets the Company on Coding, Enterprise, and a New Flagship Model "Spud"

The Sanders-AOC Data Center Moratorium Would Paradoxically Entrench Big Tech's AI Dominance

AI's Acceleration Week: Anthropic's Compute Takeover, OpenAI's Strategic Retreat, and the Race for Voice and Video

Anthropic's Leaked Claude Code Reveals Autonomous Background Agent and Three-Layer Memory Architecture