Chronological feed of everything captured from LangChain.
blog / LangChainAI / Apr 23
LangChain launches Deep Agents deploy, a beta tool that bundles open-source Deep Agents harness with production infrastructure via a single `deepagents deploy` command. It supports any LLM provider, custom instructions via AGENTS.md, Agent Skills, MCP tools, and sandboxes like Daytona or Modal, deploying a scalable LangSmith server with 30+ endpoints for MCP, A2A, Agent Protocol, human-in-the-loop, and memory access. Unlike Claude Managed Agents, it avoids vendor lock-in by using open standards and self-hostable memory in standard formats, ensuring users retain control over agent memory and data flywheels.
langchaindeep-agentsagent-deploymentopen-source-agentsai-harnessesmodel-agnosticagent-memory
“Deep Agents deploy is launched in beta as the fastest way to deploy a production-ready, model-agnostic, open-source agent harness.”
blog / LangChainAI / Apr 23
AI agents require domain-specific tacit knowledge from experts to handle unwritten conventions and technical nuances effectively. Human judgment enhances workflow design, tool selection, and context engineering, integrated through an "agent improvement loop" of rapid builds, production monitoring, and data-driven refinements. Automated LLM-as-a-judge evaluators, calibrated by experts using tools like LangSmith's Align Evaluator, scale this feedback to prioritize high-impact issues and build golden datasets from production traces.
ai-agentshuman-judgmentagent-developmentlangchainlangsmithevaluation-loopcontext-engineering
“Deterministic code in agent workflows reduces latency, token usage, and ensures critical steps execute reliably compared to full LLM sequencing.”
blog / LangChainAI / Apr 23
Agent harnesses have become the standard scaffolding for building LLM-based agents, managing tool interactions and context including memory; they persist as models improve, evidenced by 512k lines in Claude Code. Memory is not a separable plugin but a core harness function handling short- and long-term state, making closed/proprietary harnesses risky due to opaque memory management and API lock-in. Open harnesses like Deep Agents enable model-agnostic deployment with user-owned memory stores, countering model providers' incentives to proprietary-ize state for stickiness.
agent-harnessesagent-memorylangchaindeep-agentsopen-source-agentsai-lock-incontext-management
“Agent harnesses are the dominant and enduring paradigm for building agents”
blog / LangChainAI / Apr 23 / failed
blog / LangChainAI / Apr 23 / failed
blog / LangChainAI / Apr 23 / failed
blog / LangChainAI / Apr 23
LangChain's async subagents address blocking issues in traditional inline subagents by running delegated tasks in the background via Agent Protocol, returning task IDs immediately for supervisors to maintain control. Supervisors gain tools like start_async_task, check_async_task, update_async_task, cancel_async_task, and list_async_tasks to manage multiple parallel subagents, handle user inputs, and course-correct mid-task. This separation allows flexible deployment on LangSmith or self-hosted infrastructure without lock-in, scaling to complex, long-running agent workflows.
async-subagentslangchaindeep-agentsagent-protocolai-agentsmulti-agent-systemsagent-orchestration
“Inline subagents block the supervisor agent until completion due to synchronous tool calls.”
blog / LangChainAI / Apr 23 / failed
blog / LangChainAI / Apr 23 / failed
blog / LangChainAI / Apr 23 / failed
youtube / LangChainAI / Apr 12
A recent AI hackathon, organized by a LangChain ambassador and featuring Lovable, successfully catalyzed innovation within the real estate and construction sectors. The event brought together senior engineers and industry experts to leverage AI for solving long-standing industry problems, demonstrating significant potential for technological disruption in these traditionally legacy fields. The hackathon not only generated promising solutions but also inspired a broader movement towards AI adoption in real estate across Europe.
ai-hackathonreal-estate-techlangchainlow-code-aicommunity-buildingstartup-ecosystem
“AI hackathons can effectively drive innovation and solve problems in legacy industries like real estate and construction.”
youtube / LangChainAI / Apr 12
Generative UI spans a spectrum from controlled (predefined components invoked by agents), declarative (runtime assembly of developer-specified Lego-like blocks), to fully open-ended (on-the-fly code generation), enabling agent-mediated interactions across custom apps, super-apps like ChatGPT, and disposable software. Current demos showcase Copilot Kit's AGUI protocol integrating these paradigms with LangChain agents, Shadcn UI generation via structured JSON schemas, WorkOS widget skills for codebase-aware integrations, and experimental open-gen UI for interactive visualizations. Future shifts emphasize continuous learning from human feedback in production agents, mirroring Cursor's data labeling approach, alongside emerging standards like A2UI from Google.
generative-uiai-agentslangchaincopilot-kitworkosdeclarative-uiagentic-systems
“All UI will become AI-mediated within years, extending to mundane devices like refrigerators”
blog / LangChainAI / Apr 7
LangChain's new Fleet offering distinguishes between two agent authorization models: "on-behalf-of" (Assistants) and fixed credentials (Claws). This differentiation addresses varying security and access control needs for AI agents interacting with external tools. The choice of model impacts data access, sharing capabilities, and the necessity for human-in-the-loop guardrails, particularly for sensitive actions.
langchainlangsmith-fleetllm-agentsagent-authorizationidentity-managementhuman-in-the-loop
“Traditional agent authorization operated 'on-behalf-of' the user, leveraging their specific access privileges.”
blog / LangChainAI / Apr 7
LangChain will have a significant presence at Google Cloud Next 2026, focusing on agent development and deployment. They will showcase updates to the LangChain ecosystem, including LangSmith for observability, evaluation, and deployment, and participate in breakout sessions covering secure, high-velocity runtimes for AI agents and frictionless developer experiences. LangChain will also host networking events and announce LangSmith's availability on the Google Cloud Marketplace.
google-cloud-nextlangchainai-agentslangsmithdeveloper-experiencellm-infrastructureevent-guide
“LangChain will be present at Google Cloud Next 2026 from April 22-24 at Booth #5006.”
blog / LangChainAI / Apr 7
Moda is an AI-native design platform catering to non-designers, enabling the creation of professional-grade visual content through a multi-agent system. This system, built with Deep Agents and observed via LangSmith, addresses the challenge of AI in visual design by employing a custom Domain Specific Language (DSL) for layout representation, contextual engineering, and a collaborative user experience. The platform prioritizes iterative design and efficient resource utilization, demonstrating a strong product-market fit in enterprise sales for pitch deck generation.
ai-designmulti-agent-systemsllm-applicationslangchainlangsmithtool-usecontext-engineering
“Moda utilizes a multi-agent system, built on Deep Agents, for generating production-grade visual designs.”
blog / LangChainAI / Apr 7
LangSmith Fleet now supports shareable skills, enabling agents to be equipped with specialized knowledge for specific tasks. These skills codify domain expertise, improving agent utility by providing essential context that basic reasoning alone lacks. This functionality addresses the challenge of knowledge silos within teams and facilitates consistent agent performance across an organization by making crucial information uniformly accessible.
langchainfleetllmsagent-frameworksknowledge-managementai-agents
“LangSmith Fleet now supports shareable skills for equipping agents with specialized task knowledge.”
blog / LangChainAI / Apr 7
LangChain introduces "Agent Middleware" to enable deep customization of LLM agent harnesses, moving beyond basic prompt and tool adjustments. This system provides distinct hooks (e.g., `before_model`, `wrap_tool_call`) to inject custom logic at various stages of the agent's operational loop. This architecture allows for the implementation of complex features like PII redaction, dynamic tool selection, and robust error handling, which are critical for production-grade AI applications while keeping core agent logic decoupled.
ai-agentsllm-orchestrationlangchainsoftware-architectureagentic-workflows
“Agent harnesses connect LLMs to their environment, data, memory, and tools, forming the core loop of an agent.”
blog / LangChainAI / Apr 7
LangChain emphasizes targeted, behavior-driven evaluations for their Deep Agents framework, aiming to improve accuracy and reliability in production environments. Their methodology prioritizes curating specific evals based on observed agent behavior and desired outcomes, rather than relying on broad benchmarks. This approach focuses on optimizing for both correctness and efficiency, using metrics like step ratio, tool call ratio, and solve rate.
llm-evaluationagent-testinglangchaindeep-agentsllm-metricssoftware-development-lifecycle
“Effective agent evaluations prioritize measuring specific behaviors relevant to production over accumulating a high number of general tests.”
blog / LangChainAI / Apr 7 / failed
blog / LangChainAI / Apr 7
This post outlines a systematic, step-by-step methodology for evaluating AI agents, emphasizing practical considerations from initial setup to production deployment. It differentiates between various evaluation levels, dataset construction strategies, and grader designs. The core insight revolves around building a comprehensive evaluation framework that integrates manual review, automated checks, and continuous feedback loops to ensure agent reliability and performance.
ai-agentsllm-evaluationobservabilitymlopssoftware-testing
“Manual review of agent traces is crucial before building automated evaluation infrastructure.”
youtube / LangChainAI / Apr 7
The conversation explores the paradigm shift from traditional software development to building AI agents, highlighting the increased complexity introduced by non-deterministic LLM behavior. It emphasizes the critical role of "context engineering" and "agent harnesses" in navigating these complexities, particularly for long-horizon agents. The discussion also touches upon the use of tracing for debugging and collaboration, and the growing importance of human feedback and iterative development in refining agent performance.
long-horizon-agentsagent-harnessesllm-agentscontext-engineeringlangchainai-developmentmemory
“Traditional software development relies on code as the sole source of truth, enabling predictable behavior through direct inspection.”
youtube / LangChainAI / Apr 7
The industry is transitioning from 'cloud-native' to 'AI-native' infrastructure, shifting the developer's role from writing deterministic code to orchestrating non-deterministic systems. This new stack relies on the Model Context Protocol (MCP) for tool integration and requires specialized observability and secure sandboxing to mitigate risks like prompt injection and non-deterministic failures. For enterprise viability, this infrastructure must move beyond local developer environments into secure, VPC-deployed runtimes with rigorous audit logging.
gen-aiagent-infrastructurellm-agentsdeveloper-toolsai-nativeobservabilitysecurity
“LLMs have reached a threshold of reliability that enables them to effectively run in loops and call tools, unlocking agentic capabilities.”
youtube / LangChainAI / Apr 7 / failed
youtube / LangChainAI / Apr 7
LangChain, initially known for its chain-based LLM applications, is evolving to become a crucial orchestration layer for AI agents. The company is actively developing frameworks like LangGraph to enable the creation of custom, controllable, and persistent agents that operate within the nuanced spectrum between rigid chains and fully autonomous AI. This strategy addresses the current limitations of both extremes, focusing on practical, production-ready agent deployments.
ai-agentsllm-orchestrationlangchaincognitive-architecturesai-uxdeveloper-tools
“Agents are defined by an LLM controlling the application's flow, deciding what actions to take, and utilizing tools and memory.”
blog / LangChainAI / Apr 7
LangChain has introduced async subagents to Deep Agents, shifting from blocking inline execution to a non-blocking, task-based delegation model. This architecture utilizes the Agent Protocol to support stateful, remote, and heterogeneous agent deployments, allowing supervisor agents to manage multiple concurrent long-running tasks without stalling the user interaction loop.
deep-agentsasync-subagentsagent-protocolllm-agentsmultimodal-ailangchain-updates
“Async subagents enable the supervisor agent to launch parallel tasks and continue user interactions without blocking execution.”
blog / LangChainAI / Apr 7
LangChain Fleet has partnered with Arcade.dev to provide agents with secure and reliable access to over 7,500 optimized tools. This integration addresses the complexities of managing multiple API connections by centralizing tool access through Arcade's MCP gateway. The partnership focuses on delivering agent-specific tools, improving tool selection and reducing issues like hallucinated parameters, while also offering robust authentication and authorization mechanisms for agent actions.
llm-agentstool-integrationlangchaindeveloper-toolsapi-gatewaysagent-security
“LangSmith Fleet now provides access to over 7,500 agent-optimized tools via Arcade.dev's MCP runtime.”
blog / LangChainAI / Apr 5
Continual learning in AI agents extends beyond mere model weight updates, encompassing three distinct layers: the model itself, the operational harness, and external context. Understanding these layers is crucial for developing AI systems that exhibit sustained improvement. This architecture enables more granular and flexible learning strategies, moving beyond the limitations of single-layer continuous learning.
ai-agentscontinual-learningllm-infrastructuremachine-learningagentic-systems
“Continual learning for AI agents occurs at three primary layers: the model, the harness, and the context.”
blog / LangChainAI / Apr 2
Deep Agent harness evaluations reveal that open models like GLM-5 and MiniMax M2.7 now perform comparably to closed frontier models on core agent tasks, including file operations, tool use, and instruction following. This parity is achieved at significantly lower costs and latency, making open models viable alternatives or complements in production deployments. The Deep Agents SDK and CLI simplify integration by abstracting model-specific complexities.
open-source-modelsllm-agentsmodel-evaluationllm-benchmarkingcost-efficiencylatency-optimizationdeep-agents-sdk
“Open models GLM-5 and MiniMax M2.7 achieve performance similar to closed frontier models on core agent tasks.”
blog / LangChainAI / Apr 2
Recent evaluations by LangChain demonstrate that leading open-weight models like GLM-5 and MiniMax M2.7 perform comparably to closed frontier models on core agent tasks including file operations, tool use, and instruction following. This parity is achieved with significantly reduced costs and improved latency. These advancements enable more viable real-world agent deployments in production environments.
open-source-llmsllm-evaluationagentic-aillm-cost-efficiencydeepeagentslangchain
“Open models match closed frontier models in core agent task performance.”
blog / LangChainAI / Apr 1
LangChain is expanding its enterprise AI offerings through a strategic partnership with NVIDIA and significant upgrades to its agent management platform, LangSmith. Key developments include rebranding Agent Builder to LangSmith Fleet, introducing advanced access controls and audit logging, and releasing new open-source tooling. These updates aim to enhance the security, scalability, and functionality of AI agents for enterprise deployment.
langchain-updatesai-agentslangsmithllm-opsdeveloper-toolsproduct-announcementscommunity-events
“LangChain's Agent Builder has been rebranded to LangSmith Fleet and now includes enhanced features for enterprise agent management.”
blog / LangChainAI / Mar 31
LangChain and MongoDB have partnered to integrate MongoDB Atlas as a comprehensive backend for AI agents, addressing the complexities of moving agent prototypes to production. This collaboration provides a unified platform for retrieval, persistent memory, operational data access, and observability, eliminating the need for fragmented infrastructure. The integration aims to leverage MongoDB's existing enterprise presence to streamline the development and deployment of reliable AI agents.
llm-agentsmongodblangchaindata-infrastructurevector-searchobservabilityrag
“The partnership between LangChain and MongoDB enables a complete AI agent backend on MongoDB Atlas.”
youtube / LangChainAI / Mar 12
The conversation with Harrison Chase, co-founder of LangChain, explores the rapid evolution of AI agents, emphasizing the critical role of "harnesses" in enabling LLMs to perform complex tasks. These harnesses, comprising components like system prompts, planning tools, sub-agents, and file systems, are more crucial than the underlying models themselves for achieving reliable and predictable agent behavior. The discussion differentiates between conversational and long-horizon agents, highlighting the increasing importance of coding agents due to their versatility and LLM training data. LangChain's journey reflects this evolution, moving from basic abstractions to sophisticated agent runtimes like Langraph, with a strong focus on observability and continuous improvement for agent engineering.
ai-agentsllm-harnesseslangchainai-infrastructureagent-runtimesobservabilityno-code-ai
“The 'harness' layer is the most important component in the AI agent stack, surpassing the significance of the underlying models.”
blog / LangChainAI / Mar 10
Harness engineering is critical for transforming raw AI models into functional and useful agents. It encompasses all the infrastructure, logic, and tools surrounding a model that enable it to perform complex tasks, maintain state, interact with external environments, and overcome inherent model limitations. This engineering discipline focuses on designing systems that extend and enhance model intelligence rather than solely relying on innate model capabilities.
llm-agentsharness-engineeringmodel-harness-architectureautonomous-ailangchainai-system-design
“An AI agent is a combination of a model and a "harness," where the harness comprises all code, configuration, and execution logic external to the model itself.”
blog / LangChainAI / Mar 9
LangChain implemented a Deep Agent to automate their Go-To-Market (GTM) processes, integrating with existing systems like Salesforce and Gong. This agent significantly improved lead conversion rates and sales rep efficiency by automating lead research, personalized outreach drafting, and account intelligence, maintaining human-in-the-loop oversight and continuous learning from rep interactions.
go-to-market-aisales-automationllm-agentslangchaincrm-integrationhuman-in-the-loopai-tooling
“The GTM agent increased lead-to-qualified-opportunity conversion by 250% from December 2025 to March 2026.”
blog / LangChainAI / Mar 9
LangChain developed a GTM agent using Deep Agents to automate and optimize sales workflows, from lead qualification and personalized outreach to account intelligence. This agent significantly improved conversion rates and sales rep efficiency by integrating with existing systems like Salesforce and Gong, providing a human-in-the-loop mechanism for review and continuous learning from rep interactions. The system also expanded beyond sales, demonstrating utility for other teams like engineering and customer success due to its comprehensive data access.
go-to-market-strategyai-agentssales-automationdeep-agents-frameworklangchain-use-casellm-in-productioncrm-integration
“The GTM agent significantly increased lead-to-qualified-opportunity conversion rates by 250% from December 2025 to March 2026.”
blog / LangChainAI / Mar 5
This article outlines LangChain's methodology for evaluating coding agent skills, emphasizing the need for structured evaluation to ensure performance gains. The process involves setting up clean testing environments, defining constrained tasks with clear metrics, and iteratively refining skills. Key to this approach is leveraging tools like LangSmith for observability and performance comparison.
llm-agentsagent-skillsevaluationlangchainlangsmithcoding-agentstool-use
“Skills are dynamically loaded instructions, scripts, and resources that enhance agent performance in specialized domains.”
blog / LangChainAI / Mar 5
Evaluating skills is crucial for enhancing coding agent performance. Skills are dynamically loaded prompts that improve agent capabilities in specialized domains. A robust evaluation pipeline involves setting up clean testing environments, defining constrained tasks with clear metrics, and strategically organizing skill content. LangSmith provides tools for experiment tracking and analysis.
llm-agentsevaluationlangchainlangsmithcoding-agentsai-testing
“Skills are dynamic, specialized instructions that improve coding agent performance.”
blog / LangChainAI / Mar 4
LangChain has released a new CLI and "Skills" framework for LangSmith, designed to empower AI coding agents. This allows agents to perform complex tasks within the LangSmith ecosystem, such as adding tracing, building test sets, and evaluating performance. This integration dramatically improves agent performance; for instance, Claude Code's pass rate on specific tasks jumped from 17% to 92%. The approach emphasizes agent-driven improvement loops, utilizing a terminal-first methodology.
langchainlangsmithcliai-agentsdeveloper-toolsevaluationllm-engineering
“LangSmith CLI and associated skills enable AI coding agents to proficiently navigate and interact with the LangSmith ecosystem.”
blog / LangChainAI / Mar 4
LangChain\'s new LangSmith CLI and "Skills" paradigm enable AI coding agents to autonomously navigate and optimize within the LangSmith ecosystem. This integration dramatically improves agent performance by providing curated instructions and scripts for tasks like tracing, dataset curation, and evaluation. The approach promotes a virtuous cycle of agent-driven improvement, enhancing development workflows without overwhelming agents with excessive tools.
langsmithai-agentscli-toolsagent-developmentperformance-evaluationllm-toolinglangchain
“The LangSmith CLI allows AI coding agents to perform LangSmith-related tasks directly from the terminal.”
blog / LangChainAI / Mar 1
LangChain's recent updates focus on bolstering enterprise-grade AI agent development and deployment. Key initiatives include a strategic partnership with NVIDIA for a full-stack agent platform, advancements in LangSmith for secure agent fleet management with new features like "Skills" and "Sandboxes," and enhancements to open-source libraries like `langgraph` and `deepagents`. These updates aim to provide robust infrastructure for building, deploying, and managing AI agents in production environments, particularly for enterprise use cases requiring security, control, and scalability.
langchain-updatesai-agentslangsmithllm-opsdeveloper-toolsai-frameworksagent-orchestration
“LangChain has partnered with NVIDIA to deliver a comprehensive enterprise platform for building and operating AI agents.”
youtube / LangChainAI / Jan 16
Context engineering is crucial for optimizing LLM agent performance, cost, and latency. Key techniques involve managing the agent's context window by offloading information to file systems, progressively disclosing tools and skills, and using sub-agents for isolation. Emerging trends include the development of models that can learn to manage their own context and the increasing use of agents for personal life management and bioscience applications.
llm-agentscontext-engineeringai-infrastructuremulti-agent-systemsagent-workflowsllm-memoryretrieval-augmented-generation
“Context engineering is a critical aspect of building effective LLM agents.”
youtube / LangChainAI / Nov 21
The AI agent landscape is rapidly evolving, moving beyond single LLM calls to complex, autonomous "deep agents." These agents leverage improved models, sophisticated harnesses, and file systems for state management. While early agents struggled with reliability, current deep agents, often designed for asynchronous, "first draft" work, demonstrate increased capabilities in complex tasks like research and coding, demanding new approaches to UX and evaluation.
ai-agentsllm-developmentlangchainai-frameworksagentic-workflowshuman-in-the-loopux-design
“AI agents have evolved from simple single LLM calls to more complex, autonomous 'deep agents' that utilize iterative processing and tool use.”
blog / LangChainAI / Oct 23
LangSmith introduces new capabilities to monitor AI agents in production, addressing the limitations of traditional observability. The platform now treats multi-turn interactions as first-party "threads," enabling more comprehensive analysis. Key features include an Insights Agent for automated usage pattern categorization and Multi-turn Evals for assessing complete conversational trajectories, providing vital feedback for agent improvement.
ai-agentsllm-observabilityai-evalslangchainproduction-monitoring
“Traditional observability and pre-production testing methods are insufficient for evaluating agent performance in real-world, multi-turn interactions.”
youtube / LangChainAI / Oct 8 / failed
youtube / LangChainAI / Sep 11 / failed
youtube / LangChainAI / Jul 23
Enterprise agent adoption hinges on a simple expected-value equation: maximize the probability of success × value delivered, while minimizing the cost of failure. The most effective levers are making agent behavior more deterministic (workflows + agents, not workflows vs. agents), reducing perceived risk through observability tools, and designing UX patterns—reversible changes, human approval gates, and "first draft" outputs—that bound downside exposure. The next architectural frontier is "ambient agents" triggered by events rather than humans, enabling one-to-many scale, but these must retain human-in-the-loop checkpoints to remain deployable in enterprise contexts.
ai-agentsenterprise-aihuman-in-the-looplangchainagentic-workflowsambient-agentsllm-infrastructure
“Enterprise agent adoption is governed by an expected-value formula: (P(success) × value if right) − (P(failure) × cost if wrong) must exceed the cost of running the agent.”
blog / LangChainAI / Jun 9
LangGraph's latest release week delivers a set of primitives targeting efficiency and control in agentic workflows: node-level caching reduces redundant computation during development, deferred nodes enable clean map-reduce and multi-agent coordination patterns, and pre/post model hooks give developers lifecycle control over ReAct agent message flow. On the JS side, LangGraph v0.3 ships resumable streams, full type-safety on `.stream()`, and ergonomic graph construction APIs. Together, these features push LangGraph closer to a production-grade orchestration layer for both single and multi-agent systems.
langgraphlangchainagent-frameworksllm-infrastructuredeveloper-toolsworkflow-orchestrationreact-agents
“LangGraph now supports node/task-level caching, allowing individual node results to be cached and reused across workflow runs.”
youtube / LangChainAI / May 30 / failed
youtube / LangChainAI / May 20 / failed
blog / LangChainAI / Apr 21
Trellix implemented LangChain, LangGraph, and LangSmith to develop "Sidekick," an internal application addressing cybersecurity integration and log parsing backlogs. This initiative significantly reduced manual log parsing from days to minutes and accelerated plugin development. The project highlights how LLMOps tools can streamline development workflows, improve customer satisfaction, and provide clear communication of AI processes to diverse stakeholders.
cybersecurityllm-applicationslangchainlanggraphlangsmithdeveloper-toolscase-study
“Trellix significantly reduced the time required for manual log parsing from days to minutes using Sidekick.”