absorb.md — A knowledge graph of what AI thinkers are actually saying

blog / GoogleDeepMind / Feb 18

Google DeepMind Expands AI Initiatives in India for Scientific and Educational Advancement

Google DeepMind is broadening its National Partnerships for AI initiative to India, focusing on integrating advanced AI capabilities into the country's science and education sectors. This strategic collaboration involves providing frontier AI models, fostering research through initiatives like the Google.org Impact Challenge: AI for Science, and transforming educational practices with AI-powered learning tools. The goal is to accelerate scientific discovery, enhance learning outcomes, and address India's national priorities in AI adoption.

ai-policygoogle-deepmindai-for-scienceai-in-educationglobal-partnershipsindia-tech

“Google DeepMind is collaborating with Indian government bodies and institutions to broaden AI access and deployment.”

blog / GoogleDeepMind / Feb 12

Gemini 3 Deep Think: Advancing Frontier Reasoning in STEM and Engineering

Gemini 3 Deep Think is a specialized reasoning mode designed for high-complexity scientific research and engineering. It demonstrates state-of-the-art performance across rigorous benchmarks in mathematics, competitive programming (Codeforces Elo 3455), and AGI (ARC-AGI-2), while showing practical utility in identifying logical flaws in peer-reviewed research and optimizing physical material fabrication.

gemini-deep-thinkai-reasoningllm-applicationsscientific-discoveryengineering-solutionsapi-accessgoogle-deepmind

“Gemini 3 Deep Think achieved an 84.6% score on the ARC-AGI-2 benchmark.”

blog / GoogleDeepMind / Feb 11

Gemini DeepMind Advances Human-AI Collaboration in Scientific Discovery

Gemini Deep Think, leveraging agentic reasoning, has advanced beyond Olympiad-level problem solving to contribute to professional research in mathematics, physics, and computer science. This involves autonomous research, AI-guided collaboration, and semi-autonomous evaluation of open problems, demonstrating its utility in complex, open-ended scientific challenges. The system is described as a "force multiplier" for human intellect, handling knowledge retrieval and rigorous verification, enabling scientists to focus on conceptual depth and creative direction.

ai-researchmathematicscomputer-sciencephysicshuman-ai-collaborationlarge-language-modelsdeepmind-gemini

“Gemini Deep Think achieved Gold-medal standard at the International Mathematics Olympiad (IMO) and similar results at the International Collegiate Programming Contest by summer 2025.”

blog / GoogleDeepMind / Feb 1

Nano Banana 2: Google DeepMind's Latest Image Model Prioritizes Speed and Accessibility

Google DeepMind has launched Nano Banana 2 (Gemini 3.1 Flash Image), an advanced image generation model that integrates the high-speed intelligence of Gemini Flash with the advanced capabilities previously exclusive to Nano Banana Pro. This release aims to democratize access to sophisticated image generation features, such as advanced world knowledge, precise text rendering, and enhanced creative control, while maintaining rapid processing speeds. The model is being rolled out across various Google products, including the Gemini app, Google Search, AI Studio, and Google Cloud, demonstrating a broad integration strategy. Furthermore, Google DeepMind continues to emphasize content provenance through the integration of SynthID and C2PA Content Credentials for AI-generated media.

image-generationgemini-modelai-enhancementscreative-toolsgoogle-productssynthidc2pa-content-credentials

“Nano Banana 2 combines high-speed intelligence with advanced image generation capabilities.”

blog / GoogleDeepMind / Feb 1

Gemini 3.1 Pro: Scaling Core Reasoning for Complex Synthesis and Creative Coding

Gemini 3.1 Pro provides a significant leap in core reasoning capabilities over its predecessor, specifically targeting complex problem-solving and agentic workflows. It demonstrates advanced proficiency in code-based creative generation, such as animated SVGs and interactive 3D interfaces, while showing a marked performance increase in logic-pattern benchmarks.

gemini-3.1-prollm-updatesragreasoningagentic-modelsai-development-tools

“Gemini 3.1 Pro more than doubles the reasoning performance of 3 Pro on the ARC-AGI-2 benchmark.”

blog / GoogleDeepMind / Feb 1

Nano Banana 2: Google’s New Flash Image Model Combines Pro-Level Capabilities with High Speed

Google has released Nano Banana 2 (Gemini 3.1 Flash Image), a new image generation model that integrates the advanced intelligence and creative controls of Nano Banana Pro with the high-speed processing of Gemini Flash. This model significantly expands access to sophisticated image manipulation features, offering advanced world knowledge, precise text rendering, enhanced creative control, and robust provenance tools. Nano Banana 2 aims to provide a versatile solution for diverse workflows, from rapid iterative design to highly accurate, production-ready visual content across various Google platforms.

image-generationmultimodal-aigoogle-productsgenerative-aiai-modelscontent-credentials

“Nano Banana 2 integrates high-speed performance with advanced visual intelligence.”

blog / GoogleDeepMind / Jan 22

D4RT: Unified 4D Scene Reconstruction via Parallelizable Spatial-Temporal Queries

D4RT is a unified encoder-decoder Transformer designed for Dynamic 4D Reconstruction and Tracking, replacing fragmented specialized models with a single query-based framework. By mapping 2D pixels to 3D space and time via parallelizable queries, it achieves significant latency reductions (up to 300x) while performing point tracking, point cloud reconstruction, and camera pose estimation. This efficiency enables potential real-time deployment in robotics and AR spatial computing.

4d-reconstructionai-modelscomputer-visionroboticsaugmented-realityreal-time-aideepmind

“D4RT is 18x to 300x more computationally efficient than previous state-of-the-art 4D reconstruction methods.”

youtube / GoogleDeepMind / Jan 15

Google DeepMind

Google DeepMind, led by Demis Hassabis, is at the forefront of AI development, aiming for Artificial General Intelligence (AGI). The company has successfully integrated its AI research into Google's product ecosystem, demonstrating significant advancements in models like Gemini. Despite the competitive landscape and concerns about an "AI bubble," DeepMind emphasizes responsible AI development and a scientific approach to achieve breakthroughs in various fields.

ai-developmentgoogle-deepmindagi-systemsllm-limitationsworld-modelsai-ethicstech-competition

“Google DeepMind aims to achieve Artificial General Intelligence (AGI) within 5-10 years.”

blog / GoogleDeepMind / Jan 1

Project Genie: Advancing World Models for Interactive Environment Generation

Project Genie is an experimental research prototype powered by Genie 3, Nano Banana Pro, and Gemini, enabling users to create, explore, and remix interactive virtual worlds. This platform advances world model capabilities by providing real-time environment generation, dynamic physics simulation, and diverse interaction possibilities, moving beyond static 3D snapshots. It aims to broaden access to and gather user feedback on world models for AI research and generative media.

deepmind-aiworld-modelsgenerative-aiproject-genieai-prototypesimmersive-experiencesai-research

“Project Genie, powered by Genie 3, allows users to create, explore, and remix interactive virtual worlds.”

blog / GoogleDeepMind / Dec 19

Gemma Scope 2: Open-Source Interpretability Tools for Large Language Models

Gemma Scope 2 is an open-source suite of interpretability tools designed to enhance understanding of large language model (LLM) internal processes. It provides full coverage for the Gemma 3 family of models, from 270M to 27B parameters, enabling researchers to debug emergent behaviors, audit AI agents, and develop safety interventions. The toolkit utilizes sparse autoencoders (SAEs) and transcoders to visualize and analyze model decision-making, addressing critical issues like jailbreaks, hallucinations, and sycophancy.

ai-interpretabilitygemma-scopeai-safetyllm-mechanistic-interpretabilityopen-source-ai

“Gemma Scope 2 is the largest open-source release of interpretability tools by an AI lab to date.”

blog / GoogleDeepMind / Dec 18

Google DeepMind Accelerates US DOE's Genesis Mission with Frontier AI Access for National Labs

Google DeepMind partners with the US Department of Energy to support the Genesis Mission by granting accelerated access to its AI tools, starting with the Gemini-powered AI co-scientist on Google Cloud for all 17 National Laboratories. This multi-agent system synthesizes information to generate hypotheses and has validated drug repurposing for liver fibrosis and predicted antimicrobial resistance mechanisms matching unpublished experiments. Access will expand in early 2026 to AlphaEvolve for algorithm design, AlphaGenome for non-coding DNA analysis, and WeatherNext for forecasting, aiming to boost discoveries in energy, materials, and biomedicine.

ai-in-sciencegoogle-deepmindgenesis-missionnational-labs-partnershipai-co-scientistalphaevolvealphagenome

“Google DeepMind provides immediate accelerated access to AI co-scientist for scientists at all 17 DOE National Laboratories via Google Cloud.”

youtube / GoogleDeepMind / Dec 11 / failed

The arrival of AGI | Shane Legg (co-founder of DeepMind) - Google DeepMind: The Podcast

blog / GoogleDeepMind / Dec 11

Google DeepMind Expands Partnership with UK AI Safety Institute for Foundational AI Safety Research

Google DeepMind has expanded its partnership with the UK AI Safety Institute (AISI) through a new Memorandum of Understanding, moving beyond model testing to foundational safety research. This collaboration aims to accelerate safe AI development by focusing on critical areas such as monitoring AI reasoning processes, understanding socioaffective impacts, and evaluating economic implications of AI. The partnership involves sharing proprietary models and data, joint research, and collaborative problem-solving to mitigate AI risks and ensure beneficial AI progress.

ai-safetyuk-ai-safety-institutegoogle-deepmindai-policyai-ethicsai-monitoringeconomic-impact-of-ai

“Google DeepMind has expanded its partnership with the UK AI Safety Institute (AISI) to foundational safety research.”

blog / GoogleDeepMind / Dec 10

Google DeepMind and UK Government Partner for AI Advancement in Key Sectors

Google DeepMind is strengthening its collaboration with the UK government to leverage AI for national prosperity and security. This partnership focuses on accelerating AI access in science, education, modernizing public services, and enhancing national security. The initiative aims to position the UK as a leader in AI innovation application, setting a precedent for international collaborations.

ai-advancementpublic-private-partnershipuk-governmentscientific-innovationeducation-technologypublic-services-modernizationnational-security-ai

“Google DeepMind will provide priority access to its "AI for Science" models to UK scientists.”

youtube / GoogleDeepMind / Dec 10 / failed

Google DeepMind robotics lab tour with Hannah Fry - Google DeepMind: The Podcast

youtube / GoogleDeepMind / Oct 28

Nano Banano: A New Era of AI-Powered Creative Tools

Nano Banano is a multimodal image generation and editing model developed by Google DeepMind. It combines the visual quality of Imagine models with the conversational and multimodal capabilities of Gemini 2.0 Flash. The model empowers creators by automating tedious tasks, allowing them to focus on creative aspects. It signifies a shift towards more personalized and interactive AI tools in artistic and professional fields, emphasizing user intent and control.

ai-image-generationmultimodal-aicreative-toolshuman-computer-interactionai-ethicslatent-diffusion-modelsgenerative-ai-applications

“AI models like Nano Banano empower creators by automating tedious tasks, enabling them to focus on creativity.”

youtube / GoogleDeepMind / Aug 19 / failed

Genie 3: A New Frontier for World Models [Jack Parker-Holder and Shlomi Fruchter] - 743

youtube / GoogleDeepMind / Aug 16

Genie 3: How DeepMind Built a Real-Time World Model with Persistent Memory from Scratch

Google DeepMind's Genie 3 represents a convergence of three internal research threads — Genie 2 (3D environment generation), VO2 (high-quality video generation), and GameNGen (the "Doom paper") — into a single real-time, text-prompted world model capable of generating interactive, photorealistic environments. The model's most technically significant capability is persistent spatial memory exceeding one minute, achieved without explicit 3D representations (no NeRFs, Gaussian splatting, etc.), instead emerging from scale and a frame-by-frame generation approach. The team views Genie 3 not as an agent but as a general-purpose environment simulator, positioning it as infrastructure for training embodied AI agents — potentially bridging the sim-to-real gap in robotics by combining data-driven realism with the scalability of simulation. Text-to-world prompting (replacing image prompting from prior Genie versions) was enabled by cross-pollination with the VO project team, and emergent behaviors like terrain-appropriate agent locomotion arose from scale rather than explicit engineering.

world-modelsgenerative-aivideo-generationgoogle-deepmindembodied-airoboticsreinforcement-learning

“Genie 3's persistent spatial memory (1+ minute) was achieved without any explicit 3D world representation such as NeRFs or Gaussian splatting, instead emerging from a frame-by-frame generative approach at scale.”

youtube / GoogleDeepMind / Jun 26

AlphaEvolve: Scaling Evolutionary Search for Superhuman Algorithmic Discovery

AlphaEvolve is an autonomous coding agent that utilizes LLMs and evolutionary search to discover novel, high-efficiency algorithms for computer science and mathematics. By iterating through generations of code and filtering them via strict evaluation functions or simulators, it can navigate vast, non-intuitive search spaces to find superhuman optimizations. Its ability to produce interpretable code rather than black-box models allows for human verification and deployment in critical infrastructure, such as Google's data centers.

ai-coding-agentevolutionary-algorithmsllm-applicationsscientific-discoveryalgorithm-optimizationself-improving-aideepmind-research

“AlphaEvolve has discovered algorithms for data center scheduling and training infrastructure that are currently deployed in Google's production environment.”

youtube / GoogleDeepMind / Apr 24 / failed