A new test of AI capabilities consists of puzzles that humans are able to solve without too much trouble, but which all leading AI models struggle with. To improve and pass the test, AI companies will need to balance problem-solving abilities with cost.
It is hard because they chose to make it hard by trying to do far too many things at the same time and sell it as a complete product.
Yep that is a problem too, the focus in creating general ai is really slowing down ai research on making it better at specific stuff.
Making it a master of social situations and emotional responses is getting in the way of the ai being good at intelligence and logic for example.
We need more specialized ai research instead of so much fake general intelligence
Yeah, people are frequently terrible at understanding context so it shouldn’t be surprising that a computer has difficulty too.
There are actually a lot of specialized applications of neural network based computing being used for science, but they don’t get the flashy headlines because they are a tool. Those projects use it to find things to focus on narrowing down what people should look into first for confirmation, like ancient settlement patterns, stars that might have planets, and other things where patterns exist but are hard to see.
Some examples are listed here at a high level. In all cases the ai leads to humans confirming and then working from there, it isn’t the end result on its own. https://medium.com/@jeyadev_needhi/uncovering-the-past-how-ai-is-transforming-archaeology-38ded420896d