Skill Automation Feasibility Index Safi
1 mentions across 0 people
All mentions
Unknown speaker
Recommendedpaper · 2026-04-09
“We present the Skill Automation Feasibility Index (SAFI), benchmarking four frontier LLMs -- LLaMA 3.3 70B, Mistral Large, Qwen 2.5 72B, and Gemini 2.5 Flash -- across 263 text-based tasks spanning all 35 skills in the U.S. Department of Labor's O*NET taxonomy (1,052 total model calls, 0% failure rate).”
LLM Automation Feasibility of Skills ↗