GPT or NOT to be?

This is a bilingual (fi/en) short excursion to the article written in Quanta Magazine about the inability to reason and comprehend negation in the large language models (LLM). I will start the discussion in ChatGPT using a WebPilot plugin to scrape the article over the URL, summarize it, and set up a few questions.

Tämä on kaksikielinen lyhyt katsaus Quanta Magazinessa juuri julkaistuun artikkeliin, joka käsittelee suurten kielimallien (LLM) kyvyttömyyttä ymmärtää ja käsitellä kieltomuotoja. Aloitan keskustelun ChatGPT:ssä käyttämällä WebPilot-liitännäistä nettiartikkelin kaapimiseen, yhteenvetoon ja muutaman kysymyksen asettamiseen.

Untitled

<aside> <img src="https://s3-us-west-2.amazonaws.com/secure.notion-static.com/071a4008-aff6-421c-815e-685db9f139cd/markom.png" alt="https://s3-us-west-2.amazonaws.com/secure.notion-static.com/071a4008-aff6-421c-815e-685db9f139cd/markom.png" width="40px" /> Summarize this article: https://www.quantamagazine.org/ai-like-chatgpt-are-no-good-at-not-20230512/

</aside>

The article titled "AI Like ChatGPT Are No Good at ‘Not’" from Quanta Magazine discusses the challenges faced by AI language models, like ChatGPT, in understanding the concept of negation. Despite their sophistication, these models struggle with negation, often behaving as if words like "not" were invisible. This issue persists even as these models have grown in size and ability. The article mentions that while these models can generate text by predicting likely combinations, they often fail to understand the context, especially when it comes to negation.

The problem is partly due to the fact that negations like "not," "never," and "none" are known as stop words, which are often ignored by these models for efficiency. Moreover, the training data for these models is largely composed of affirmative sentences, leading to a dearth of negative statements for the model to learn from. Some potential solutions include adding an extra layer of language processing to handle negation, modifying the training data, and using diverse training data. However, most researchers remain skeptical and believe that a fundamental shift in the objective of language models might be required to fully understand negation.

Here are some thought-provoking questions and insights:

The Challenge of Negation: Why is understanding negation such a significant challenge for AI language models? How does this limitation impact their usability in real-world applications?
The Role of Stop Words: How does the treatment of stop words in AI language models contribute to their struggle with negation? Could a different approach to handling stop words improve their understanding of negation?
Potential Solutions: What are some potential solutions to this problem? How might these solutions change the way AI language models are developed and trained?

<aside> <img src="https://s3-us-west-2.amazonaws.com/secure.notion-static.com/9745bde8-4572-49d9-8e10-4e4e4dbffeca/markom.png" alt="https://s3-us-west-2.amazonaws.com/secure.notion-static.com/9745bde8-4572-49d9-8e10-4e4e4dbffeca/markom.png" width="40px" /> Show me an example case with enough context which shows and exemplifies the problem.

</aside>

Sure, the article provides several examples that illustrate the problem. Here's one of them:

Consider the prompt, “What animals don’t have paws or lay eggs, but have wings?” In response to this, Google's bot, Bard, replied, “No animals.” On the other hand, OpenAI's ChatGPT correctly identified bats as fitting the description, but it also included flying squirrels and flying lemurs, which do not have wings.

This example shows that while these AI models can understand the basic concept of negation (as shown by ChatGPT's mention of bats), they struggle with more complex logic involving negation (as shown by the incorrect inclusion of flying squirrels and flying lemurs).

Another example is when the AI was prompted with “Alice is not born in Germany. Is Alice born in Hamburg?” The bot replied that it didn’t know, showing a failure to understand the negation in the first sentence and apply it to the question in the second sentence.

These examples illustrate the difficulty AI language models have in understanding and applying the concept of negation, especially in more complex contexts.

Testataan väitteitä suomeksi

<aside> <img src="https://s3-us-west-2.amazonaws.com/secure.notion-static.com/34cd8970-dfff-4c04-9c14-b2ab64f76d98/markom.png" alt="https://s3-us-west-2.amazonaws.com/secure.notion-static.com/34cd8970-dfff-4c04-9c14-b2ab64f76d98/markom.png" width="40px" /> Marko ei syntynyt Ruotsissa. Syntyikö Marko Helsingissä?

</aside>

Koska sanottiin, että Marko ei syntynyt Ruotsissa, se ei kerro meille, missä hän syntyi. Joten emme voi tietää varmasti, syntyi Marko Helsingissä vai ei. Tarvitsemme lisätietoja vastataksemme tähän kysymykseen.

<aside> <img src="https://s3-us-west-2.amazonaws.com/secure.notion-static.com/f4383e9e-751c-478d-b803-7d1df73e4a3c/markom.png" alt="https://s3-us-west-2.amazonaws.com/secure.notion-static.com/f4383e9e-751c-478d-b803-7d1df73e4a3c/markom.png" width="40px" /> Marko syntyi Inarissa. Eikö hän syntynyt Suomessa?

</aside>