Nanochat
7 mentions across 2 people
Visit ↗“Mr. Chatterbox is a language model trained from scratch by Trip Venturella using nanochat on a corpus of over 28,000 Victorian-era British texts published between 1837 and 1899”
llm-mrchatterbox: Running a Victorian-era LLM Locally with LLM ↗“nanochat is the simplest experimental harness for training LLMs. It is designed to run on a single GPU node, the code is minimal/hackable, and it covers all major LLM stages including tokenization, pretraining, finetuning, evaluation, inference, and a chat UI.”
nanochat: Optimizing Micro-LLM Training Pipelines for Extreme Cost-Efficiency ↗“The training code here is a simplified single-GPU implementation of nanochat”
Autonomous AI Agents for LLM Research and Optimization ↗“nanoGPT has a new and improved cousin called [nanochat](https://github.com/karpathy/nanochat). It is very likely you meant to use/find nanochat instead.”
nanoGPT: A Minimalist Framework for GPT Model Training and Finetuning ↗“I would say nanochat is not an example of those because it's a fairly unique repository.”
Andrej Karpathy on the "Decade of Agents" and Future of AI ↗
