absorb.md

Skill Automation Feasibility Index Safi

1 mentions across 0 people

Unknown speaker
paper · 2026-04-09
Recommended

We present the Skill Automation Feasibility Index (SAFI), benchmarking four frontier LLMs -- LLaMA 3.3 70B, Mistral Large, Qwen 2.5 72B, and Gemini 2.5 Flash -- across 263 text-based tasks spanning all 35 skills in the U.S. Department of Labor's O*NET taxonomy (1,052 total model calls, 0% failure rate).

LLM Automation Feasibility of Skills