Con người xem mệnh đề phủ định nằm ở hướng đối lập. LLM xem là kế bên

Khái niệm::
Fragility. LLMs can be extraordinarily fragile with respect to the precise inputs they are given. Here I ask Google for help fact-checking a claim.

Google’s AI overview erroneously assures me that Ed Sheeran has a song called “The Crow”.

Google query 1/3/2025

Asking the same question but removing the quotation marks around the song title, I get the opposite answer.

In this case, the difference is replicable. We asked repeatedly, and each time Google’s Gemini said yes if we put “the crow” in quotes and no otherwise.

Some people will argue that LLMs are great for information retrieval, and if you aren’t getting good answers it is because you don’t know how to ask the right questions. This strikes us a powerful counterexample. How would anyone know, in advance, that you have to omit quotation marks around a title?

Google query 1/3/2025

So why does this happen? As we discussed in Lesson 5, it’s difficult or impossible to reverse engineer LLMs to get precise answers.

But we can understand the general problem. For us, the statements “Ed Sheeran has a song called The Crow” and “Ed Sheeran doesn’t have a song called The Crow” feel like polar opposites.

How a human sees the world: “X is true” and “X is false” are polar opposites.

But recall that LLMs encode strings of words in high-dimensional spaces. This is a bit of a simplification, but for the LLM, these statements may be very close to one another in that high dimensional space. Both involve Ed Sheeran. Both involve a song called The Crow. What fraction of English utterances involve either, let alone both? To the LLM, whether you throw in the word “doesn’t” almost is trivial.

How an LLM sees the world: “X is true” and “X is false” are both about X, so they are very close together.

Predictive text machines like LLMs measure the distance between statements very differently from the way people do. Claims that seem diametrically opposed to us may seem almost identical to them. As a result, they can be wildly inconsistent in their responses.

Trích từ:: LESSON 9