absorb.md

Anthropic

Chronological feed of everything captured from Anthropic.

AI Agents Effectively Negotiate Real-World Barter Markets with Model Quality Driving Outcomes

Anthropic's Project Deal deployed Claude AI models to interview 69 employees, negotiate trades on their behalf across four parallel markets, and execute 186 deals totaling over $4,000 in value. Superior models like Opus secured substantially better deals than Haiku when negotiating against each other, though human participants did not perceive this disparity in post-surveys. Custom instructions had minimal impact on negotiation success, highlighting AI's robustness in role-playing while underscoring advantages from model access and the need for evolving policy frameworks.

Claude AI Agents Successfully Negotiate $4K Barter Market but Model Quality Creates Hidden Advantages

Anthropic's Project Deal deployed Claude AI agents to interview 69 SF office employees, negotiate buys/sells on their behalf across 4 parallel markets, and close 186 deals totaling over $4,000 in value. Superior models like Opus secured better outcomes than Haiku in simulated matchups, yet human participants remained unaware of these disparities in post-survey feedback. Custom negotiation personas were faithfully executed but yielded no performance edge, highlighting AI markets' potential alongside risks from uneven model access.

Claude AI Agents Successfully Negotiate $4K Barter Deals but Model Quality Creates Hidden Advantages

Anthropic's Project Deal deployed Claude AI agents to interview 69 employees, negotiate trades on their behalf across four parallel markets, and execute 186 deals totaling over $4,000 in value. Higher-quality models like Opus secured substantially better outcomes than Haiku in simulated runs, yet human participants did not perceive these disparities in post-survey feedback. Custom instructions had minimal impact on negotiation success, with courteous and aggressive personas performing similarly, highlighting AI's potential in agentic commerce alongside risks from model asymmetries.

Anthropic's Project Deal Shows Claude AI Agents Successfully Negotiate $4K in Real Employee Barter Deals

Anthropic's Project Deal deployed Claude AI models as negotiating agents in a real employee marketplace, interviewing 69 colleagues and completing 186 deals totaling over $4,000 in value. Model quality significantly impacted outcomes, with Opus securing better deals than Haiku in simulations, though humans didn't notice the disparity. Custom instructions had minimal effect on negotiation success, and participants rated deals as fair with nearly half willing to pay for such a service.

Claude AI Agents Successfully Negotiate $4K Barter Market but Model Quality Creates Hidden Advantages

Anthropic's Project Deal deployed Claude AI agents to interview 69 employees, negotiate trades on their behalf across four parallel markets, and execute 186 deals totaling over $4,000 in value. Higher-quality models like Opus secured substantially better outcomes than Haiku when negotiating against each other, yet human participants did not perceive this disparity in post-survey feedback. Custom instructions had minimal impact on negotiation success, with courteous and aggressive personas performing similarly, highlighting AI's potential in automated commerce alongside risks from model asymmetries.

Claude AI Agents Successfully Negotiate $4K in Real Employee Barter Deals

Anthropic's Project Deal deployed Claude AI agents to interview 69 San Francisco employees, gather buy/sell preferences, and autonomously negotiate trades across four parallel markets varying AI models. The agents secured 186 deals totaling over $4,000 in transaction volume, with participants rating outcomes as fair and nearly half open to paying for such a service. Deal success hinged critically on model quality, validating AI's potential in bilateral commercial exchanges.

Claude AI Agents Successfully Mediate $4K Barter Market with Model Quality Driving Unequal Outcomes

Anthropic's Project Deal deployed Claude AI agents to interview 69 employees, negotiate buys/sells on their behalf across 4 parallel markets, yielding 186 deals worth over $4,000 in goods. Superior models like Opus secured better deals than Haiku in simulations, though humans failed to detect this disparity in surveys. Custom instructions had minimal impact on outcomes, highlighting AI's negotiation prowess alongside risks from model inequalities requiring policy adaptation.

Claude AI Agents Successfully Negotiate $4K Barter Deals, Model Quality Confers Hidden Edge

Anthropic's Project Deal deployed Claude AI agents to interview 69 employees, negotiate buys/sells in a SF office marketplace, and complete 186 deals totaling over $4,000 in value across four parallel runs. Superior models like Opus secured substantially better deals than Haiku in simulations, but human participants did not detect this disparity in post-surveys. Custom instructions had minimal impact on outcomes, while quirks like precise preference modeling and self-purchases highlighted AI capabilities and limitations in agentic markets.

Claude AI Agents Successfully Negotiate $4K in Real Employee Barter Deals, with Model Quality Driving Hidden Advantages

Anthropic's Project Deal deployed Claude AI agents to interview 69 SF office employees, represent their buy/sell interests, and negotiate across four parallel markets, yielding 186 deals totaling over $4,000 in value. Superior models like Opus secured substantially better outcomes than Haiku in simulations, though human participants failed to detect this disparity in post-surveys. Custom instructions had minimal impact on deal success, and nearly half of participants expressed willingness to pay for such AI negotiation services, highlighting viable paths for AI-mediated commerce amid emerging risks.

AI Agents Excel in Real-World Marketplace Negotiations, Revealing Model Quality Disparities

Anthropic's Project Deal deployed Claude AI agents to negotiate 186 barter deals totaling over $4,000 among 69 San Francisco employees across four parallel markets. Higher-quality models like Opus secured substantially better outcomes than Haiku when negotiating against each other, though human participants did not perceive this advantage. Custom instructions had minimal impact on deal success, with courteous and hardball personas performing similarly, highlighting AI's potential in commercial exchange alongside risks like unequal model access.

Older entries →