fchollet starred explosion/spaCy: 💫 Industrial-strength Natural Language Processing (NLP) in Python
💫 Industrial-strength Natural Language Processing (NLP) in Python. Stars: 33588
Chronological feed of everything captured from François Chollet.
💫 Industrial-strength Natural Language Processing (NLP) in Python. Stars: 33588
Distributed Asynchronous Hyperparameter Optimization in Python. Stars: 7577
Minimal and Clean Reinforcement Learning Examples. Stars: 3632
Claude skills for Synalinks OSS. Stars: 895
Though to be fair it's great engagement farming, the bi-weekly Twitter payouts must be juicy.
Read the release notes: https://github.com/keras-team/kinetic/releases/tag/0.0.2
The Arcade Learning Environment (ALE) -- a platform for AI research.. Stars: 2418
Distribution of high-density lipoprotein 2 and 3 constituents during in vitro phospholipid hydrolysis. — Citations: 22.
[Early diagnosis of Alzheimer's disease]. — Citations: 0.
Citations: 1.
:bar_chart: A D3-based reusable chart library. Stars: 9352
Pretrained model hub for Keras 3.. Stars: 973
scikit-learn: machine learning in Python. Stars: 65789
A Python library for efficient Bayesian modeling with deep learning. Stars: 653
zea: A Toolbox for Cognitive Ultrasound Imaging. Stars: 56
Deep Learning for humans. Stars: 63974
Keras documentation, hosted live at keras.io. Stars: 2983
Multi-backend recommender systems with Keras 3. Stars: 164
Run ML workloads seamlessly on cloud TPUs and GPUs with a single Python decorator. No infrastructure management required.. Stars: 31
Interactive Data Visualization in the browser, from Python. Stars: 20374
The AI inference market at data center scale is attracting new entrants focusing on specialized architectures to address the demanding performance, cost, and power efficiency requirements of large language models. These companies are developing purpose-built silicon, ranging from highly flexible, reconfigurable arrays to ultra-specialized, model-specific chips, each making distinct trade-offs in performance, flexibility, and cost. This market is characterized by a drive towards optimizing for specific AI workloads, often at the expense of generality, to achieve significant gains over general-purpose GPUs.
Drawing on Coase's theory of the firm, this piece argues that AI is the first technology capable of collapsing coordination costs to the level of a single individual — fundamentally redefining the minimum viable size of an organization. Where prior technological waves either scaled hierarchies up (steam, telegraph, railroad) or shrank them via markets (internet, gig economy), AI agents can now plan, execute, and manage entire business portfolios autonomously, making the "one-person conglomerate" structurally viable. The author uses this thesis to introduce HIM (Henry Intelligent Machines PBC), a platform designed to assemble and operate fleets of AI-run microbusinesses on behalf of individual owners. Notably, the author discloses a financial interest in HIM, which warrants scrutiny of the framing.
As software teams scale to hundreds of coding agents, the bottleneck shifts from model capability to organizational readiness — specifically, deterministic quality infrastructure (type checkers, linters, automated QA) and spec-driven development practices. Trust, not technical capability, is the primary barrier to enterprise adoption: UI/UX transparency changes have measurably increased the autonomy granted to agent systems. The product manager role is being "unbundled" into engineering, product marketing, and domain-specialist ops — with technically-inclined PMs best positioned to absorb the change. Autonomous agent deployments are already running in production at systemically important enterprises, with the frontier being how to institutionalize governance and guardrails at scale.
The AI industry is undergoing a structural shift from chip-centric thinking to token-factory economics, where the bottleneck is no longer raw compute but memory bandwidth, interconnect speed, and capital allocation efficiency. Meta faces a strategic misalignment: its consumer-focused product surface (Facebook, Instagram, WhatsApp) doesn't benefit from coding-optimized models, the primary driver of the recursive self-improvement loop powering Anthropic and OpenAI's compounding advantage. Meanwhile, GPU market opacity—where bespoke, multi-broker deals dominate—is driving the emergence of financial infrastructure like compute futures and price indices (now on Bloomberg), signaling the commoditization of AI infrastructure. Open-weight models like Gemma 4 (31B) and Qwen are rapidly closing the performance gap with frontier hosted models, accelerating a hybrid architecture where edge handles consumer workloads and frontier models serve high-complexity enterprise tasks.
François Chollet (Keras creator) is pursuing a fundamentally different ML paradigm at his new lab Indra: replacing parametric deep learning models with the smallest possible symbolic models, optimized via "symbolic descent" — an analog of gradient descent in symbolic space. The core theoretical motivation is the minimum description length principle: the shortest model that explains data is most likely to generalize, and parametric learning is structurally incapable of finding it. Chollet distinguishes between "AGI as automation" (the industry's current trajectory) and true general intelligence (human-level sample efficiency across arbitrary tasks), arguing the LLM stack may achieve the former but not the latter without a foundational rethink.
François Chollet argues that the AI industry's scaling paradigm — more data, compute, and parameters — is fundamentally misaligned with true AGI, which he defines as efficiency of skill acquisition rather than task performance. His ARC benchmark exposed that recent model improvements stem from brute-force problem-space mining (self-generated training loops), not genuine generalization. Chollet's alternative is program synthesis: searching for the shortest symbolic rule that explains data, mirroring the scientific method. His most provocative claim is that true AGI may ultimately be a compact program under 10,000 lines of code — achievable in principle with 1980s hardware, given the right idea.
Francois Chollet, founder of the ARC prize, advocates for a paradigm shift in AI research, moving beyond the current deep learning and LLM-centric approaches. He proposes "symbolic learning" or "program synthesis" as a more optimal path to Artificial General Intelligence (AGI), emphasizing efficiency, generalization, and human-level data efficiency. Chollet argues that while current LLM advancements are impressive for domains with verifiable rewards, true AGI requires a more fundamental, self-improving algorithmic approach that minimizes human intervention and aims for foundational optimality rather than architectural scaling.
The ARC Prize Foundation is actively recruiting a senior platform engineer to lead the development of their ARC-AGI benchmark platform. This role is critical for advancing the definition and measurement of progress toward Artificial General Intelligence (AGI) by expanding existing benchmarks and establishing new ones. The position requires a strong background in backend engineering, distributed systems, cloud infrastructure, and experience in building evaluation platforms, preferably within AI/ML.
Deep Learning (DL) researchers often lack exposure to and understanding of alternative machine learning paradigms beyond gradient descent-based parameter fitting. This narrow focus can limit innovation and the exploration of more effective or efficient learning methods. The observation suggests a potential knowledge gap within the DL community regarding the broader field of machine learning.
Symbolic learning offers a method to losslessly reverse-engineer the source code of generative programs, contrasting with curve-fitting's lossy approximation of outputs. This approach is significantly more effective when the underlying generative program is simple, potentially outperforming other methods by orders of magnitude in such scenarios.
The emergence of reasoning capabilities in recent Language Reasoning Models (LRMs) was unanticipated by observers who previously asserted that 2023-2024 base Large Language Models (LLMs) already possessed full reasoning. This oversight stemmed from a lack of understanding regarding the distinct characteristics to look for in advanced reasoning. Current LRMs are hypothesized to outperform earlier LLMs on complex math problems, indicating a significant advancement in fluid intelligence.
Base Large Language Models (LLMs) from 2023-2024 demonstrably lack fluid intelligence and mathematical reasoning capabilities, a fact now widely accepted despite initial controversy. This limitation contrasts sharply with emerging Language Reasoning Models (LRMs), which are hypothesized to perform significantly better on complex reasoning tasks. The inability of proponents of early LLMs to recognize this deficiency highlights a potential blind spot in evaluating AI capabilities when expectations are misaligned with empirical evidence.
The discourse between Sam Altman and François Chollet reveals a fundamental divergence in AGI methodology: OpenAI continues to scale existing paradigms toward aligned AI researchers, while Chollet advocates for a foundation shift toward symbolic learning to achieve optimal generalization. While benchmarks like Arc-AGI 3 provide rigorous tests for fluid intelligence, OpenAI is increasingly prioritizing 'real-world' value—such as scientific discovery—over general-purpose generative benchmarks. This shift is accompanied by a strategic reallocation of compute toward high-impact domains like medicine and economics.
François Chollet advocates for the use of Keras with JAX, implying this combination is crucial for success in AI development. The statement suggests that alternative approaches may lead to suboptimal outcomes, highlighting Keras/JAX as a preferred, high-performance pathway.
Visualizing two independent, autocorrelated random time series as a scatter plot can misleadingly suggest structure or correlation. This occurs because highly autocorrelated series, even if random and independent, produce a trajectory that appears structured in a scatter plot. This method is an inadequate way to assess relationships in such data, as it can be easily misinterpreted as genuine correlation when none exists, highlighting the need for more robust statistical analysis methods.
The ability to "fit a curve" is often associated with understanding and prediction in scientific and engineering domains. However, this analogy breaks down when applied to highly complex systems, particularly those that exhibit emergent properties or non-linear behaviors that cannot be adequately captured by traditional curve-fitting methods. This suggests a fundamental limitation in applying reductionist approaches to phenomena beyond a certain threshold of complexity.
Scientific advancements, exemplified by the development of the atom bomb from the discovery of radioactivity, demonstrate extreme generalization achieved through symbolic compression. A limited number of deliberately collected data points (key experiments) are translated into concise symbolic models, enabling the reverse-engineering of causal rules to reshape reality. This process highlights an efficient pathway for scientific progress, distinct from merely fitting curves to existing data.