AI agent performance increased significantly over one year
βAI agents went from about 12% success on real computer tasks a year ago to 66% now. This is agents actually navigating software. They're clicking through forms. They're pulling data. They're finishing multi step jobs in real systems. 66% is not good enough to let something completely loose unsupervised, but it's good enough for giving an agent a really narrow bounded job and checking its work, making sure that it's getting stuff done faster.β

