Garry Tan

Chronological feed of everything captured from Garry Tan.

github_push / garrytan / Apr 18

garrytan pushed to garrytan/gbrain: code update

code update

github_push / garrytan / Apr 18

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 18

garrytan pushed to garrytan/gbrain: code update

code update

github_push / garrytan / Apr 18

garrytan pushed to garrytan/gbrain: code update

code update

github_push / garrytan / Apr 18

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 18

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 18

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 18

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 17

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 17

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 17

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 17

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 17

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 17

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 17

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 17

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 17

garrytan pushed to garrytan/gstack: code update

code update

github_star / garrytan / Apr 17

garrytan starred davidfstr/rdiscount: Discount (For Ruby) Implementation of John Gruber's Markdown

Discount (For Ruby) Implementation of John Gruber's Markdown. Stars: 749

paper / garrytan / Apr 17

AgentGA: A Novel Genetic Algorithm for Autonomous Code Generation

AgentGA introduces a new framework for evolving autonomous code-generation runs by optimizing the agent seed, which comprises the task prompt and optional parent archives. This system couples a population-level genetic algorithm with long-horizon agents, utilizing a deterministic 1:1 elite tournament for selection and an adaptively controlled operator allocation. The core innovation lies in searching over reusable starting conditions rather than directly modifying code, enabling inherited artifacts to improve subsequent autonomous runs.

autonomous-agentscode-generationgenetic-algorithmsmachine-learningautomlai-optimization

“AgentGA is a framework that evolves autonomous code-generation runs by optimizing the agent seed, defined as the task prompt plus optional parent archives.”

paper / garrytan / Apr 17

Geo2Sound: Generating Acoustically Realistic Soundscapes from Satellite Imagery

Geo2Sound is a novel framework that addresses the challenge of generating realistic soundscapes from satellite imagery. It uniquely combines structural geospatial attribute modeling, semantic hypothesis expansion, and geo-acoustic alignment. This approach allows for the generation of acoustically plausible and geographically consistent soundscapes, outperforming existing baselines.

soundscape-generationsatellite-imageryimage-to-audiogeospatial-aimachine-learningmultimedia

“Existing image-to-audio models struggle with complex, wide-area semantic ambiguity in satellite imagery, limiting their application.”

paper / garrytan / Apr 17

Switch: Hierarchical Multi-Skill System for Agile Humanoid Locomotion

The "Switch" system addresses limitations in humanoid robot skill transitions by introducing a hierarchical multi-skill framework. This framework utilizes a Skill Graph (SG) for kinematically similar transitions, a deep reinforcement learning-trained whole-body tracking policy, and an online skill scheduler. The scheduler enables real-time, robust execution and smooth transitions between diverse locomotion skills, enhancing safety and practical applicability of humanoid robots.

humanoid-robotsdeep-reinforcement-learningskill-switchingwhole-body-controlrobotics-locomotionskill-graphmotion-imitation

“Existing whole-body control approaches using deep reinforcement learning struggle with flexible transitions between distinct skills in humanoid robots.”

paper / garrytan / Apr 17

UniDoc-RL: Enhancing Visual RAG with Hierarchical Reinforcement Learning

UniDoc-RL is a novel reinforcement learning framework for visual Retrieval-Augmented Generation (RAG) that addresses the limitations of generic retrieval signals in existing systems. By formulating visual information acquisition as a sequential decision-making problem with a hierarchical action space, UniDoc-RL refines visual evidence from coarse-grained document retrieval to fine-grained image selection and active region cropping. The framework utilizes a dense multi-reward scheme and Group Relative Policy Optimization (GRPO) for effective end-to-end training without a separate value network, achieving significant performance gains on benchmarks.

visual-ragreinforcement-learninglarge-vision-language-modelscomputer-visionsequential-decision-makingdeep-learning

“Existing visual RAG systems struggle with fine-grained visual semantics due to generic retrieval signals.”

github_push / garrytan / Apr 16

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 16

garrytan pushed to garrytan/gbrain: code update

code update

github_push / garrytan / Apr 16

garrytan pushed to garrytan/gbrain: code update

code update

github_push / garrytan / Apr 16

garrytan pushed to garrytan/gbrain: code update

code update

github_push / garrytan / Apr 16

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 16

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 16

garrytan pushed to garrytan/gbrain: code update

code update

github_push / garrytan / Apr 16

garrytan pushed to garrytan/gstack: code update

code update

github_star / garrytan / Apr 16

garrytan starred sparklemotion/nokogiri: Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby.

Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby.. Stars: 6247

github_star / garrytan / Apr 16

garrytan starred Mange/roadie: Making HTML emails comfortable for the Ruby rockstars

Making HTML emails comfortable for the Ruby rockstars. Stars: 1345

github_star / garrytan / Apr 16

garrytan starred expressjs/express: Fast, unopinionated, minimalist web framework for node.

Fast, unopinionated, minimalist web framework for node.. Stars: 68944

github_push / garrytan / Apr 14

garrytan pushed to garrytan/gstack: code update

code update

github_push / garrytan / Apr 14

garrytan pushed to garrytan/gbrain: code update

code update

github_push / garrytan / Apr 14

garrytan pushed to garrytan/gbrain: code update

code update

github_push / garrytan / Apr 14

garrytan pushed to garrytan/gbrain: code update

code update

github_push / garrytan / Apr 14

garrytan pushed to garrytan/gbrain: code update

code update

github_push / garrytan / Apr 14

garrytan pushed to garrytan/gbrain: code update

code update

github_star / garrytan / Apr 14

garrytan starred Shopify/liquid: Liquid markup language. Safe, customer facing template language for flexible web apps.

Liquid markup language. Safe, customer facing template language for flexible web apps. . Stars: 11757

tweet / @garrytan / Apr 13

Garry Tan Persists with Opus 4.6 via API Key in 2025

Garry Tan continues using Opus model version 4.6 accessed through an API key, as shared in an hourly poll on his X feed. This indicates preference for a specific older model iteration over newer alternatives. The setup relies on direct API integration without mention of platform-specific changes.

garry-tanx-feedhourly-pollopus-modelapi-keyai-tools

“Garry Tan is currently using Opus 4.6”

tweet / @garrytan / Apr 13

Garry Tan Clarifies Misleading "Hourly Poll" Framing of His X Feed

A user note labeled an hourly poll on Garry Tan's X feed, prompting an alarmed reaction ("Oh yikes"). Tan immediately calls for clarification to correct the potentially misleading or erroneous description. This indicates proactive error correction in real-time social media monitoring.

garry-tanx-feedhourly-polluser-notecontent-clarification

“A user note described Garry Tan's X feed as an 'Hourly poll'”

tweet / @garrytan / Apr 13

User-Driven Taste Customization in Agentic Note-Taking Systems

Garry Tan's SOUL md tool introduces dynamic taste supply during agent interactions for personalized content generation. Users provide taste preferences conversationally with the agent. Results vary by individual (YMMV), indicating subjective personalization.

garry-tanx-feedhourly-pollai-agentuser-notesoul-md

“SOUL md is updated with a new item tonight”

tweet / @garrytan / Apr 13

Thin Agent Harnesses Maximize Fat Skills and Code for Agentic Engineering

Agentic engineering optimizes by offloading fuzzy, human-like operations into expansive markdown-based skills and precise deterministic tasks into robust codebases. The orchestration harness remains minimal to avoid bloat. This contrasts misconceptions like prioritizing "fat harnesses," emphasizing instead "thin harness, fat skills and code."

agentic-engineeringsoftware-engineeringai-developmentcoding-practicesskills-distillationgarry-tan

“In agentic engineering, push smart fuzzy operations humans do into markdown skills.”

tweet / @garrytan / Apr 13

Garry Tan's X Feed Attracts Strong Hourly Engagement

Garry Tan's X feed prompts an hourly poll that receives an affirmative response. The user note indicates ongoing monitoring of his feed via hourly polls. The explicit "YES" suggests positive reception or validation in the poll results.

garry-tantwitter-feedhourly-pollx-platformuser-note

“An hourly poll is conducted on Garry Tan's X feed.”

tweet / @garrytan / Apr 13

OpenClaw AI Agents Generate "Prompt Reports" for Collaborative Debugging

Garry Tan and collaborators share bug reports from their OpenClaw AI agents to troubleshoot issues in task execution. This mirrors GitHub's issue tracking but applies to AI prompts, termed "prompt reports." A user example highlights repeated reminders needed for an agent to adopt a tool like Gbrain after self-installation.

open-sourceai-toolsprompt-engineeringbug-reportsdeveloper-collaborationgithubai-claws

“Garry Tan shares bug reports from OpenClaw AI agents with @chrysb and @ericlevine to resolve task snags”

tweet / @garrytan / Apr 13

Garry Tan Builds AI Coding Tool in 9 Days Using His Own Hourly X Feed

Garry Tan has utilized his hourly-poll-generated X feed as a dataset for 3 months to develop gbrain, an open-source AI tool. He constructed the entire gbrain project from scratch in just 9 days. This demonstrates rapid prototyping of specialized LLMs leveraging personal social media archives.

garry-tanx-feedgbraingithub-repopersonal-toolhourly-poll

“Garry Tan has been using his hourly poll X feed for 3 months”

tweet / @garrytan / Apr 13

Garry Tan Adopts Hotel Room Mascot as Personal Good Luck Charm

Garry Tan spotted a small figure in his hotel room and designated it as his good luck charm. This casual endorsement highlights a superstitious ritual in his otherwise tech-focused persona. The item remains in situ, serving as an impromptu talisman during his stay.

garry-tantwitter-x-feedhourly-pollhotel-roomgood-luck-charm

“Garry Tan has a 'little guy' present in his hotel room”

tweet / @garrytan / Apr 13

Garry Tan Signals Heavy Financial Investment in AI or X Ecosystem

Garry Tan reports currently allocating substantial funds ("a lot of dollars") into an unspecified high-value endeavor, shared via an hourly poll on his X feed. This indicates active capital deployment, likely into tech startups, AI, or platform enhancements given his Y Combinator leadership. The casual phrasing underscores ongoing, significant financial commitment without detailing recipients or purposes.

garry-tantwitter-feedhourly-pollventure-capitalfundingstartup-investment

“Garry Tan is currently investing a large amount of money into something”

tweet / @garrytan / Apr 13

Private Group Chats in Claw Machines Enable Discreet Social Gaming

Moltbook introduces private group chats integrated into claw machine games, allowing users to connect socially without public visibility. Garry Tan endorses this feature as a strong innovation. The concept blends physical arcade gaming with private digital communication for friends.

garry-tanx-feedhourly-pollmoltbookprivate-group-chatssocial-apps

“Moltbook implements private group chats within claw machines for user interactions”

Older entries →