absorb.md — A knowledge graph of what AI thinkers are actually saying

Solo is a physical AI inference platform that deploys ensembles of locally fine-tuned Llama models on edge devices — including Raspberry Pis and smartphones — to serve populations without reliable internet connectivity. Rather than relying on a single large model, Solo orchestrates multiple specialized models on-device, targeting agriculture, healthcare, and education for maximum social impact. In healthcare, the platform enables differential diagnosis support and automated patient reporting in rural areas, effectively functioning as a "small hospital in a box." The platform's viability is entirely dependent on Llama's open-source licensing, which allows on-device ownership, experimentation, and offline deployment.

open-source-aillama-modelsedge-aimeta-aihealthcare-aioffline-inferenceai-for-good

“Solo uses an ensemble of fine-tuned models running locally on-device rather than a single large model, enabling offline physical AI inference.”

youtube / ai-at-meta / Apr 13

Meta FAIR Releases Atomic Modeling Dataset, Scalable RL Sampling Algorithm, and Brain-Language Study

Meta's Fundamental AI Research (FAIR) team has announced three concurrent releases targeting distinct scientific frontiers: atomic-scale molecular modeling, scalable generative model training via scalar rewards, and neuroscientific mapping of language development. The Open Molecules 2025 dataset paired with a Universal Model for Atoms aims to accelerate materials and drug discovery, while the "Agent Sampling" algorithm enables generative model training without reference data. A large-scale brain study conducted with Rochild Foundation Hospital draws structural parallels between language emergence in developing brains and large language models, potentially informing both AI architecture and neuroscience.

meta-aiai-researchmolecular-modelingopen-sourceneurosciencellmscientific-discovery

“The Open Molecules 2025 dataset combined with Meta's Universal Model for Atoms enables high-speed, high-accuracy atomic-scale molecular modeling applicable to healthcare and climate research.”

youtube / ai-at-meta / Apr 13 / failed

Adjoint Sampling: A Breakthrough in Highly Scalable, Reward-Driven Generative Modeling | AI at Meta

youtube / ai-at-meta / Apr 13

LLM Representations Converge with Maturing Human Brain Language Processing

Children's brains (ages 2-5) exhibit decodable language representations from natural speech, detected via intracranial electrodes in epilepsy patients, which grow more complex with age. Llama 3 training induces representational geometries that align progressively with adult brain patterns and early childhood stages. This convergence demonstrates LLMs capture developmental trajectories of human speech comprehension beyond surface mimicry.

ai-language-modelsbrain-language-representationsneuroscience-aillm-trainingchild-language-developmentmeta-ai-research

“Humans learn language from a few million words, while modern AI requires billions.”

youtube / ai-at-meta / Apr 13

Meta Releases World's Largest DFT Dataset and Universal Atomic Model for Advanced Molecular Simulations

Meta's Open Molecules 2025 provides over 100 million DFT calculations, forming the largest and most diverse dataset covering biomolecules, metal complexes, electrolytes, and small molecules. The accompanying universal model for atoms, trained on over 30 billion atoms, sets a new standard for ML-based modeling of atomic interactions in molecules and materials. These tools enable breakthroughs in energy storage, disease treatment, and climate mitigation through enhanced molecular property prediction.

molecular-modelingdensity-functional-theoryai-for-sciencematerials-sciencemeta-aiopen-datasetmachine-learning-model

“Open Molecules 2025 contains over 100 million DFT calculations”

youtube / ai-at-meta / Apr 13

DINOv3 Scales Self-Supervised Vision to Deliver Dense, Robust Features for Zero-Shot Applications

DINOv3 advances self-supervised learning on images at unprecedented scale, yielding universal vision backbones with rich, dense features that exhibit high self-similarity and consistency across time, objects, and style changes. These features enable zero-shot tasks like segmentation and tracking with minimal annotations, powering top performance across diverse vision applications. The release includes open-source training code, model weights, efficient variants, alternative architectures, and tutorials, plus a specialized backbone for satellite imagery.

dino-v3self-supervised-learningvision-backbonescomputer-visionimage-featuresopen-source-aisatellite-imagery

“DINOv3 produces dense image features with impressive self-similarities and exceptional temporal consistency”

paper / ai-at-meta / Apr 12

Reinforcement Learning-Driven RAG for Enhanced LLM Reasoning in Multi-hop Q&A

KunLunBaizeRAG is a novel reinforcement learning-driven framework improving Large Language Model (LLM) reasoning in complex multi-hop question-answering. It tackles limitations of traditional RAG like retrieval drift and information redundancy by integrating mechanisms such as RAG-driven Reasoning Alignment (RDRA) and Search-Think Iterative Enhancement (STIE). Experimental validations confirm significant performance gains in exact match and LLM-judged scores across multiple benchmarks, demonstrating its robustness and effectiveness.

llm-inferencereinforcement-learningragmulti-hop-qaknowledge-retrievalperformance-optimizationai-frameworks

“KunLunBaizeRAG is a reinforcement learning-driven framework.”

paper / ai-at-meta / Apr 12

Stochastic Modeling Reveals A. gracilipes Locomotion as Hybrid Brownian-Tumble Movement

This study successfully models the complex locomotion of isolated A. gracilipes ants using a hybrid stochastic approach combining active Brownian motion and run-and-tumble dynamics. The model accurately reproduces observed trajectory statistics by identifying reproducible probability distributions for turn angles, run times, and waiting times. This provides a robust framework for predicting ant movement ecology and gaining insights into underlying generative mechanisms and sensory systems.

stochastic-modelingbiological-locomotionant-behaviorquantitative-methodsbiological-physicsmovement-ecology

“The movement of individual A. gracilipes ants can be accurately described by a stochastic model combining active Brownian and run-and-tumble mechanisms.”

youtube / ai-at-meta / Apr 10

Meta's AI Infrastructure Bet: Liquid Cooling, Custom Silicon, and the End of Commodity Data Centers

Meta's VP of Infrastructure Dan Rabinovich outlines a fundamental shift in data center design driven by AI workloads — rack thermal density is scaling from ~30 kW to 500–700 kW, forcing a transition from air to full-facility liquid cooling. Meta's in-house AI accelerator program (MTIA) is not primarily cost-driven but aimed at co-designing hardware/software for high-value internal workloads like ads ranking and recommendation, where workload-specific optimization yields superior performance-per-TCO. At the semiconductor level, Dennard scaling is effectively dead, shifting the competitive frontier to advanced packaging (chiplets, CoWoS, silicon-on-wafer), which introduces new yield, toolchain, and manufacturing cycle-time challenges at scale.

ai-infrastructuredata-centerssemiconductor-industrycustom-siliconliquid-coolingmeta-aicareer-advice

“AI data center rack thermal density is scaling from ~30 kW today toward 500–700 kW in next-generation designs, making air cooling architecturally insufficient.”

youtube / ai-at-meta / Apr 10

Meta's Custom Silicon for Video Transcoding: MSVP Scales Encoding Across Billions of Videos

Meta has developed MSVP (Meta Scalable Video Processor), a custom hardware accelerator purpose-built to handle the full video transcoding pipeline — decode, resize, and multi-format encode — at the scale demanded by Facebook, Instagram, and Messenger. MSVP outperforms traditional software encoders in throughput and quality, and is the first in the industry to embed objective quality metric computation directly in hardware, scoring every encode at scale. As generative AI, AR, and VR content creation accelerates, MSVP is positioned as a foundational infrastructure block for delivering that content to end users.

video-transcodingmeta-hardwarevideo-encodingai-infrastructurecustom-siliconmedia-processingscalability

“MSVP (Meta Scalable Video Processor) is a custom hardware accelerator that handles decode, multi-resolution scaling, and multi-format encoding of video at billions-of-videos scale for Facebook, Instagram, and Messenger.”

youtube / ai-at-meta / Apr 10

Meta's Research SuperCluster: How Massive GPU Infrastructure Accelerates Frontier AI Training

Meta's Research SuperCluster (RSC) combines latest-generation compute, high-speed interconnects, and fast storage to dramatically compress AI training timelines. The system enables researchers to elastically scale workloads from 8 to 8,000 GPUs, turning multi-month training runs into days. RSC's practical impact is demonstrated by the No Language Left Behind (NLLB-200) project, where a 200-language translation model was trained in ~10 days rather than months. The infrastructure is positioned as a strategic lever for Meta to iterate faster and compete at the frontier of large-scale model development.

ai-infrastructureml-traininggpu-clustersmeta-ailarge-scale-computemultilingual-aihpc

“Meta's NLLB-200 model supports translation across 200 languages, covering over 40,000 directional translation pairs.”

youtube / ai-at-meta / Apr 10

Meta's Vertical AI Infrastructure Stack: Custom Silicon, Exascale Compute, and the End of General-Purpose Hardware

Meta is executing a full-stack AI infrastructure overhaul — from custom silicon to data center architecture — driven by AI workloads growing at 1000x every two years. The company has developed two in-house chips (MTIA for ML inference/recommendation and MSVP for video encoding) to maximize performance-per-watt, bypassing GPU generality for domain-specific efficiency. Their Research Supercluster (RSC), with 16,000 GPUs and ~5 exaflops of compute, represents one of the largest AI supercomputers operational today. The core thesis: at Meta's scale (serving ~half of humanity), off-the-shelf hardware is structurally insufficient, and vertical integration of silicon, software, and data center design is the only viable path.

ai-infrastructurecustom-silicondata-centersmeta-ailarge-scale-computingml-hardwareai-accelerators

“Meta's AI workloads are growing at a rate of 1000x every two years.”

youtube / ai-at-meta / Apr 10

Meta's SAM 3 Unifies Detection, Segmentation, and Tracking with Multi-Modal Prompting

Meta has released SAM 3 (Segment Anything Model 3), a unified model that extends the original SAM's click-based prompting with text and visual prompting capabilities, enabling detection, segmentation, and tracking across both images and videos. The addition of text prompts allows batch segmentation of object categories simultaneously, reducing manual effort. Visual prompting lets users select an object to surface similar ones in the same image, with iterative follow-up prompts for refinement. SAM 3 is already integrated into production Meta products, specifically powering new effects in Instagram's Edits app.

computer-visionimage-segmentationobject-detectionmeta-aifoundation-modelsvideo-understandingmultimodal-ai

“SAM 3 is a unified model capable of object detection, segmentation, and tracking across both images and videos.”

youtube / ai-at-meta / Apr 10

Meta's SAM 3D Brings Zero-Shot Image-to-3D Reconstruction with Human Body Specialization

Meta has introduced SAM 3D, a pair of models extending the Segment Anything Model into the 3D domain, enabling geometry and texture reconstruction for any object in a single image — including occluded or non-visible surfaces. A specialized variant focuses on human body reconstruction, generating accurate meshes of body shape and pose even for partially hidden individuals or those in uncommon poses. The system targets practical deployment across robotics, scientific research, and consumer platforms like Facebook Marketplace, and is accessible via the Segment Anything Playground.

3d-reconstructioncomputer-visionmeta-aiimage-to-3dsegmentationroboticsgenerative-ai

“SAM 3D consists of two distinct models designed for image-to-3D transformation tasks.”

youtube / ai-at-meta / Apr 10

Meta's SAM Audio: Multimodal Audio Isolation and Source Separation

SAM Audio is a state-of-the-art model designed for the isolation of specific sounds within complex audio mixes. It leverages text, visual, and span-based prompts to extract distinct elements of speech, music, and general environmental noise.

audio-aimeta-aisound-separationgenerative-aimultimodal-aicontent-creation-tools

“SAM Audio can separate audio sources using natural language text prompts.”

AI at Meta

AI Infra @Scale | AI at Meta

Understanding the Llama 3 Tokenizer | Llama for Developers

Season 7, Episode 10: From Imposter Syndrom to AI Success with Guest Anya Chang, Founder and CEO of

Zuckerberg Admits Meta's Layoffs Are About AI Costs, Not AI Replacing Workers

MSL Eng Director: Promo Hacking, Industry Shifts, Regrets | John Myles White

#358 How AI Agents Will Work While You Sleep | Ruslan Salakhutdinov, Professor at Carnegie Mellon

The AI Job That Didn't Exist a Year Ago — Two Engineers Who Got Hired

SAM 3D: Behind the two-model design | AI at Meta

How AI is helping animal conservation | AI at Meta

Personalized Rehab with AI

SAM 3: Building a unified model architecture for detection and tracking

Unlocking Human Potential: Universal Design and AI at Meta

Building diverse MiniHack environments with just a few lines of code

Inside the Lab: Building for the metaverse with AI (2022)

Ex-Meta AI Chief: My Wife Wanted Me To Retire, Here's Why I Refused! | Yann LeCun X Nitin Dua | Pt 2

Advancing robotics and touch perception | AI Research from Meta FAIR

Meta PARTNR: Unlocking Human-Robot Collaboration

Yann LeCun on the future of deep learning hardware

Around the World in 3000 Hours of Egocentric Video

Introducing Meta Omnilingual Automatic Speech Recognition | Transcription for 1,600+ languages

Solo's Offline-First AI Stack Brings Ensemble Llama Models to Rural Healthcare and Agriculture via Raspberry Pi

Meta FAIR Releases Atomic Modeling Dataset, Scalable RL Sampling Algorithm, and Brain-Language Study

Adjoint Sampling: A Breakthrough in Highly Scalable, Reward-Driven Generative Modeling | AI at Meta

LLM Representations Converge with Maturing Human Brain Language Processing

Meta Releases World's Largest DFT Dataset and Universal Atomic Model for Advanced Molecular Simulations

DINOv3 Scales Self-Supervised Vision to Deliver Dense, Robust Features for Zero-Shot Applications

Reinforcement Learning-Driven RAG for Enhanced LLM Reasoning in Multi-hop Q&A

Stochastic Modeling Reveals A. gracilipes Locomotion as Hybrid Brownian-Tumble Movement

Meta's AI Infrastructure Bet: Liquid Cooling, Custom Silicon, and the End of Commodity Data Centers

Meta's Custom Silicon for Video Transcoding: MSVP Scales Encoding Across Billions of Videos

Meta's Research SuperCluster: How Massive GPU Infrastructure Accelerates Frontier AI Training

Meta's Vertical AI Infrastructure Stack: Custom Silicon, Exascale Compute, and the End of General-Purpose Hardware

Meta's SAM 3 Unifies Detection, Segmentation, and Tracking with Multi-Modal Prompting

Meta's SAM 3D Brings Zero-Shot Image-to-3D Reconstruction with Human Body Specialization

Meta's SAM Audio: Multimodal Audio Isolation and Source Separation