absorb.md — A knowledge graph of what AI thinkers are actually saying

paper / arvind-ai / Apr 10

Optimizing LLM GPU Utilization via Bound-Latency Online-Offline Colocation

Valve is a production-grade colocation system that optimizes GPU utilization by running offline workloads on idle capacity without compromising latency-critical online LLM inference. It employs a GPU runtime featuring channel-controlled compute isolation and page-fault-free memory reclamation to bound preemption latency and rate. The system demonstrates high scalability and low deployment friction, requiring negligible driver and framework modifications.

llm-inferenceresource-managementgpu-utilizationsystem-optimizationproduction-systemsoperating-systems

“Valve increases cluster utilization by 34.6%, resulting in a saving of 2,170 GPUs.”

youtube / arvind-ai / Apr 10

The Resilience of AI Adoption Amidst Market Volatility

Despite significant investment and concerns about an "AI bubble," the fundamental utility and low inference costs of existing AI models suggest that AI adoption will persist even if a market correction occurs. The impact of a crash would likely be felt more in research and development funding rather than in the continued use and integration of established AI products into daily life and work.

ai-economicsmarket-analysisgenerative-aitech-industryai-investment

“Current AI investment, totaling over a trillion dollars in data centers, has not yet demonstrably boosted GDP growth, leading to speculation of an AI bubble.”

youtube / arvind-ai / Apr 10

Challenging the AGI "Manhattan Project" Narrative

Current narratives surrounding Artificial General Intelligence (AGI) often promote a sense of impending, transformative breakthrough, urging a "Manhattan Project" approach. This perspective, however, oversimplifies the complexities of AI development, misrepresents its potential impact, and carries significant political risks. AGI is unlikely to manifest as a sudden, observable event, and its integration into society will be gradual, necessitating a more nuanced and deliberate approach than an accelerated arms race.

agi-debatesai-policyai-risksai-benchmarkingtechnological-progresssocietal-impact-of-aiai-hype-cycles

“AGI is unlikely to be a sudden, observable event with immediate, earth-shattering consequences.”

youtube / arvind-ai / Apr 10

Moravec’s Paradox: A Misleading Heuristic for AI Progress

Moravec's Paradox, which posits that tasks difficult for humans are easy for AI and vice-versa, is a flawed framework for predicting AI capabilities. Its apparent validity stems from selective focus on specific AI research domains rather than an empirical truth about AI's inherent ease or difficulty with certain tasks. This misconception has led to both alarmism and false comfort regarding AI's societal impact, particularly concerning reasoning and robotics.

ai-predictionsmoravec-paradoxai-capabilitiesai-ethicstechnological-forecastingdeep-learningrobotics

“Moravec's Paradox is not empirically supported and offers no predictive power for AI advancements.”

Arvind on AI

Optimizing LLM GPU Utilization via Bound-Latency Online-Offline Colocation

The Resilience of AI Adoption Amidst Market Volatility

Challenging the AGI "Manhattan Project" Narrative

Moravec’s Paradox: A Misleading Heuristic for AI Progress