Users of generative AI struggle to accurately assess their own competence

New research published in Computers in Human Behavior finds that while people using generative AI tools like ChatGPT do perform better on logical reasoning tests than those working without AI, they also significantly overestimate their own competence, showing a clear gap between actual performance and perceived success. In experiments involving logical reasoning problems, AI-assisted participants consistently thought they scored much higher than they did, even when given incentives to estimate accurately, and this overconfidence occurred across skill levels, effectively flattening typical patterns of self-assessment bias like the Dunning-Kruger effect. The findings suggest that interacting with generative AI can inflate users’ confidence in their answers without improving their ability to judge correctness, pointing to potential challenges for metacognition and decision-making as AI becomes more integrated into work and learning contexts.

More Info

Recent news

The new era of browsing: Putting Gemini to work in Chrome

Lawsuit Claims This AI Tool Misused Job Applicants’ Credit Info

Fake extension crashes browsers to trick users into infecting themselves

Why Google’s UCP is not a game changer in travel

Former OpenAI policy chief creates nonprofit institute, calls for independent safety audits of frontier AI models

Inside California’s upcoming year in AI

About

Support

Legal