“Hey, ChatGPT, how many R‘s are there in the word ‘strawberry’?” “There are two R‘s in the word ‘strawberry.'” “Are you sure? Because there are three.” “Actually, there are two R‘s in ‘strawberry.’ ...
ChatGPT passes “strawberry” test but fails when switched to “cranberry” AI still struggles with simple letter-counting despite broader improvements Reasoning tests like “car wash” still expose gaps in ...
Confident mistakes – or lies, if you will – are a common problem of large language models used in AI chatbots, with one common shortcoming of ChatGPT being that it would frequently miscount the number ...