Apple’s New Study Finds Flaws in AI

Efe UdinOctober 17, 2024

Apple’s new study has sparked debate on whether AI models like ChatGPT are truly smart or just clever imitators. Led by Iman Mirzadeh, Apple’s team set out to test the limits of these systems. They used a new benchmark called GSM-Symbolic to see how well large language models (LLMs) perform when dealing with tricky math and logic. The results raise big questions about the real ability of these tools to “think.”

Apple AI

A Simple Trick Exposes AI Weakness

Apple’s team found that by adding non-relevant words or numbers to questions, AI models that used to answer well started to fail. In fact, when a small, extra sentence was thrown into a problem, the accuracy of these models dropped by up to 65%. The added text did not change the core problem, but it still confused the AI systems. When the number of parts in a question increased, the models also began to struggle. This suggests that LLMs do not really “get” the meaning of the questions but respond based on patterns they have learned during training.

Join GizChina on Telegram

Real Thought or Just a Guess?

The study points out that these models seem to give smart answers only on the surface. Many of their replies are not based on real logic or math skills but follow rules and patterns that sound right. On closer look, some answers appear correct at first, but they are actually wrong. This makes it clear that AI tools may not “think” the way we do but instead mimic human speech patterns.

What Does This Mean for AI?

Apple’s study highlights the necessity to reconsider how we utilize and trust AI systems. Although these models appear intelligent, they encounter significant challenges when confronted with more intricate or complicated problems. The findings emphasize that large language models (LLMs) are not infallible and should not be entirely depended upon for tasks requiring profound reasoning or genuine comprehension.

The researchers advocate for continued exploration into enhancing AI models’ capabilities in logic and mathematics. As these technologies advance, it becomes increasingly crucial to recognize their shortcomings and establish boundaries for their usage. While AI offers numerous benefits, we must remain cautious about the potential dangers of excessively relying on systems that lack a true understanding of the complexities they attempt to address.

Disclaimer: We may be compensated by some of the companies whose products we talk about, but our articles and reviews are always our honest opinions. For more details, you can check out our editorial guidelines and learn about how we use affiliate links.

Source/VIA :

winfuture

About the author
Efe Udin
Efe Udin is a seasoned government tech policy expert and tech writer with over seven years of experience in the field. His work focuses on the intricate relationship between technology, politics, and brand rivalries, where he analyzes how these dynamics shape public discourse and influence industry trends. Efe's insights into the competitive landscape of major tech companies reveal the depth of their rivalry, particularly in mobile operating systems.

In addition to mobile platforms, Efe explores how social media has become a battleground for tech giants, representing the new wave of digital conflict. Efe's passion for technology extends beyond writing; he actively engages with emerging trends in cybersecurity and AI. He believes that understanding these technologies is crucial for navigating the complexities of modern governance and public policy. His analytical approach allows him to dissect the motivations behind corporate strategies and their implications for society, making him a valuable voice in discussions about the future of technology.

Outside of his professional endeavours, Efe has a deep appreciation for Earth Sciences. He spends his free time conducting geological surveys and fieldwork, focusing on tectonism and how continents drift over time. Additionally, Efe has been a devoted fan of Arsenal Football Club since 1999.

Qualcomm Snapdragon

Abdullah Mustapha January 23, 2025

news Samsung

Abdullah Mustapha January 23, 2025

news Samsung

Abdullah Mustapha January 23, 2025

news Phones Samsung Tech

Efe Udin January 23, 2025

NEXT ARTICLE

Show comments