Close Menu
AI Week
  • Breaking
  • Insight
  • Ethics & Society
  • Innovation
  • Education and Training
  • Spotlight
Trending

UN experts warn against market-driven AI development amid global concerns

September 20, 2024

IBM launches free AI training programme with skill credential in just 10 hours

September 20, 2024

GamesBeat Next 2023: Emerging leaders in video game industry to convene in San Francisco

September 20, 2024
Facebook X (Twitter) Instagram
Newsletter
  • Privacy
  • Terms
  • Contact
Facebook X (Twitter) Instagram YouTube
AI Week
Noah AI Newsletter
  • Breaking
  • Insight
  • Ethics & Society
  • Innovation
  • Education and Training
  • Spotlight
AI Week
  • Breaking
  • Insight
  • Ethics & Society
  • Innovation
  • Education and Training
  • Spotlight
Home»Spotlight»Study Finds Prominent AI Models Exhibit Irrational Behaviour in Logic Puzzles
Spotlight

Study Finds Prominent AI Models Exhibit Irrational Behaviour in Logic Puzzles

Ivan MassowBy Ivan MassowJune 5, 20240 ViewsNo Comments2 Mins Read
Share
Facebook Twitter LinkedIn WhatsApp Email

A study by researchers from University College London (UCL) revealed that leading AI models, including ChatGPT and Meta’s Llama, displayed irrational behaviour and made simple mistakes when solving classic logic puzzles designed to test human reasoning. The research raises concerns about the reasoning capabilities of current AI technologies.

Researchers from University College London (UCL) conducted a study on the reasoning capabilities of seven prominent AI models, including ChatGPT, Meta’s Llama, Claude 2, and Google Bard (now called Gemini). The study revealed that these large language models frequently exhibited irrational behavior and simple mistakes while solving logic puzzles designed to test human reasoning.

The AIs were tested using 12 classic logic puzzles such as the Monty Hall Problem, the Linda Problem, the Wason Task, and the AIDS Task. Though humans also struggle with these puzzles, the AI models displayed irrational responses distinct from those typically shown by humans. Notably, some AI models even refused to answer certain logic questions, citing ethical concerns.

Meta’s Llama 2 highlighted these issues by refusing to respond to questions like the Linda Problem due to perceived “harmful gender stereotypes,” affecting its performance. The best-performing AI was ChatGPT 4-0, which correctly answered 69.2% of the time, while the worst was Meta’s Llama 2 7b, with a 77.5% error rate.

These findings, published in Royal Society Open Science, indicate that current AI models do not yet possess human-like reasoning abilities and raise questions about their application in critical fields such as medicine and diplomacy.

Share. Facebook Twitter LinkedIn Telegram WhatsApp Email Copy Link
Ivan Massow
  • X (Twitter)

Ivan Massow Senior Editor at AI WEEK, Ivan, a life long entrepreneur, has worked at Cambridge University's Judge Business School and the Whittle Lab, nurturing talent and transforming innovative technologies into successful ventures.

Related News

UN experts warn against market-driven AI development amid global concerns

September 20, 2024

IBM launches free AI training programme with skill credential in just 10 hours

September 20, 2024

GamesBeat Next 2023: Emerging leaders in video game industry to convene in San Francisco

September 20, 2024

Alibaba Cloud unveils cutting-edge modular datacentre technology at annual Apsara conference

September 20, 2024

Dentistry.One unveils innovative SmileScan AI tool for oral health monitoring

September 20, 2024

Inbolt secures €15 million in Series A round to propel expansion and technological advancements

September 20, 2024
Add A Comment
Leave A Reply Cancel Reply

Top Articles

IBM launches free AI training programme with skill credential in just 10 hours

September 20, 2024

GamesBeat Next 2023: Emerging leaders in video game industry to convene in San Francisco

September 20, 2024

Alibaba Cloud unveils cutting-edge modular datacentre technology at annual Apsara conference

September 20, 2024

Subscribe to Updates

Get the latest AI news and updates directly to your inbox.

Advertisement
Demo
AI Week
Facebook X (Twitter) Instagram YouTube
  • Privacy Policy
  • Terms of use
  • Press Release
  • Advertise
  • Contact
© 2025 AI Week. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.