absorb.md

Google DeepMind

Chronological feed of everything captured from Google DeepMind.

Google DeepMind Expands AI Initiatives in India for Scientific and Educational Advancement

Google DeepMind is broadening its National Partnerships for AI initiative to India, focusing on integrating advanced AI capabilities into the country's science and education sectors. This strategic collaboration involves providing frontier AI models, fostering research through initiatives like the Google.org Impact Challenge: AI for Science, and transforming educational practices with AI-powered learning tools. The goal is to accelerate scientific discovery, enhance learning outcomes, and address India's national priorities in AI adoption.

Gemini 3 Deep Think: Advancing Frontier Reasoning in STEM and Engineering

Gemini 3 Deep Think is a specialized reasoning mode designed for high-complexity scientific research and engineering. It demonstrates state-of-the-art performance across rigorous benchmarks in mathematics, competitive programming (Codeforces Elo 3455), and AGI (ARC-AGI-2), while showing practical utility in identifying logical flaws in peer-reviewed research and optimizing physical material fabrication.

Gemini DeepMind Advances Human-AI Collaboration in Scientific Discovery

Gemini Deep Think, leveraging agentic reasoning, has advanced beyond Olympiad-level problem solving to contribute to professional research in mathematics, physics, and computer science. This involves autonomous research, AI-guided collaboration, and semi-autonomous evaluation of open problems, demonstrating its utility in complex, open-ended scientific challenges. The system is described as a "force multiplier" for human intellect, handling knowledge retrieval and rigorous verification, enabling scientists to focus on conceptual depth and creative direction.

Nano Banana 2: Google DeepMind's Latest Image Model Prioritizes Speed and Accessibility

Google DeepMind has launched Nano Banana 2 (Gemini 3.1 Flash Image), an advanced image generation model that integrates the high-speed intelligence of Gemini Flash with the advanced capabilities previously exclusive to Nano Banana Pro. This release aims to democratize access to sophisticated image generation features, such as advanced world knowledge, precise text rendering, and enhanced creative control, while maintaining rapid processing speeds. The model is being rolled out across various Google products, including the Gemini app, Google Search, AI Studio, and Google Cloud, demonstrating a broad integration strategy. Furthermore, Google DeepMind continues to emphasize content provenance through the integration of SynthID and C2PA Content Credentials for AI-generated media.

Gemini 3.1 Pro: Scaling Core Reasoning for Complex Synthesis and Creative Coding

Gemini 3.1 Pro provides a significant leap in core reasoning capabilities over its predecessor, specifically targeting complex problem-solving and agentic workflows. It demonstrates advanced proficiency in code-based creative generation, such as animated SVGs and interactive 3D interfaces, while showing a marked performance increase in logic-pattern benchmarks.

Nano Banana 2: Google’s New Flash Image Model Combines Pro-Level Capabilities with High Speed

Google has released Nano Banana 2 (Gemini 3.1 Flash Image), a new image generation model that integrates the advanced intelligence and creative controls of Nano Banana Pro with the high-speed processing of Gemini Flash. This model significantly expands access to sophisticated image manipulation features, offering advanced world knowledge, precise text rendering, enhanced creative control, and robust provenance tools. Nano Banana 2 aims to provide a versatile solution for diverse workflows, from rapid iterative design to highly accurate, production-ready visual content across various Google platforms.

D4RT: Unified 4D Scene Reconstruction via Parallelizable Spatial-Temporal Queries

D4RT is a unified encoder-decoder Transformer designed for Dynamic 4D Reconstruction and Tracking, replacing fragmented specialized models with a single query-based framework. By mapping 2D pixels to 3D space and time via parallelizable queries, it achieves significant latency reductions (up to 300x) while performing point tracking, point cloud reconstruction, and camera pose estimation. This efficiency enables potential real-time deployment in robotics and AR spatial computing.

Google DeepMind

Google DeepMind, led by Demis Hassabis, is at the forefront of AI development, aiming for Artificial General Intelligence (AGI). The company has successfully integrated its AI research into Google's product ecosystem, demonstrating significant advancements in models like Gemini. Despite the competitive landscape and concerns about an "AI bubble," DeepMind emphasizes responsible AI development and a scientific approach to achieve breakthroughs in various fields.

Project Genie: Advancing World Models for Interactive Environment Generation

Project Genie is an experimental research prototype powered by Genie 3, Nano Banana Pro, and Gemini, enabling users to create, explore, and remix interactive virtual worlds. This platform advances world model capabilities by providing real-time environment generation, dynamic physics simulation, and diverse interaction possibilities, moving beyond static 3D snapshots. It aims to broaden access to and gather user feedback on world models for AI research and generative media.

Gemma Scope 2: Open-Source Interpretability Tools for Large Language Models

Gemma Scope 2 is an open-source suite of interpretability tools designed to enhance understanding of large language model (LLM) internal processes. It provides full coverage for the Gemma 3 family of models, from 270M to 27B parameters, enabling researchers to debug emergent behaviors, audit AI agents, and develop safety interventions. The toolkit utilizes sparse autoencoders (SAEs) and transcoders to visualize and analyze model decision-making, addressing critical issues like jailbreaks, hallucinations, and sycophancy.

Google DeepMind Accelerates US DOE's Genesis Mission with Frontier AI Access for National Labs

Google DeepMind partners with the US Department of Energy to support the Genesis Mission by granting accelerated access to its AI tools, starting with the Gemini-powered AI co-scientist on Google Cloud for all 17 National Laboratories. This multi-agent system synthesizes information to generate hypotheses and has validated drug repurposing for liver fibrosis and predicted antimicrobial resistance mechanisms matching unpublished experiments. Access will expand in early 2026 to AlphaEvolve for algorithm design, AlphaGenome for non-coding DNA analysis, and WeatherNext for forecasting, aiming to boost discoveries in energy, materials, and biomedicine.

Google DeepMind Expands Partnership with UK AI Safety Institute for Foundational AI Safety Research

Google DeepMind has expanded its partnership with the UK AI Safety Institute (AISI) through a new Memorandum of Understanding, moving beyond model testing to foundational safety research. This collaboration aims to accelerate safe AI development by focusing on critical areas such as monitoring AI reasoning processes, understanding socioaffective impacts, and evaluating economic implications of AI. The partnership involves sharing proprietary models and data, joint research, and collaborative problem-solving to mitigate AI risks and ensure beneficial AI progress.

Google DeepMind and UK Government Partner for AI Advancement in Key Sectors

Google DeepMind is strengthening its collaboration with the UK government to leverage AI for national prosperity and security. This partnership focuses on accelerating AI access in science, education, modernizing public services, and enhancing national security. The initiative aims to position the UK as a leader in AI innovation application, setting a precedent for international collaborations.

Nano Banano: A New Era of AI-Powered Creative Tools

Nano Banano is a multimodal image generation and editing model developed by Google DeepMind. It combines the visual quality of Imagine models with the conversational and multimodal capabilities of Gemini 2.0 Flash. The model empowers creators by automating tedious tasks, allowing them to focus on creative aspects. It signifies a shift towards more personalized and interactive AI tools in artistic and professional fields, emphasizing user intent and control.

Genie 3: How DeepMind Built a Real-Time World Model with Persistent Memory from Scratch

Google DeepMind's Genie 3 represents a convergence of three internal research threads — Genie 2 (3D environment generation), VO2 (high-quality video generation), and GameNGen (the "Doom paper") — into a single real-time, text-prompted world model capable of generating interactive, photorealistic environments. The model's most technically significant capability is persistent spatial memory exceeding one minute, achieved without explicit 3D representations (no NeRFs, Gaussian splatting, etc.), instead emerging from scale and a frame-by-frame generation approach. The team views Genie 3 not as an agent but as a general-purpose environment simulator, positioning it as infrastructure for training embodied AI agents — potentially bridging the sim-to-real gap in robotics by combining data-driven realism with the scalability of simulation. Text-to-world prompting (replacing image prompting from prior Genie versions) was enabled by cross-pollination with the VO project team, and emergent behaviors like terrain-appropriate agent locomotion arose from scale rather than explicit engineering.

AlphaEvolve: Scaling Evolutionary Search for Superhuman Algorithmic Discovery

AlphaEvolve is an autonomous coding agent that utilizes LLMs and evolutionary search to discover novel, high-efficiency algorithms for computer science and mathematics. By iterating through generations of code and filtering them via strict evaluation functions or simulators, it can navigate vast, non-intuitive search spaces to find superhuman optimizations. Its ability to produce interpretable code rather than black-box models allows for human verification and deployment in critical infrastructure, such as Google's data centers.