• masterplan79th@lemmy.world
    link
    fedilink
    English
    arrow-up
    8
    arrow-down
    18
    ·
    1 month ago

    When you ask an LLM a reasoning question. You’re not expecting it to think for you, you’re expecting that it has crawled multiple people asking semantically the same question and getting semantically the same answer, from other people, that are now encoded in its vectors.

    That’s why you can ask it. because it encodes semantics.

    • ebu@awful.systems
      link
      fedilink
      English
      arrow-up
      24
      ·
      1 month ago

      because it encodes semantics.

      if it really did so, performance wouldn’t swing up or down when you change syntactic or symbolic elements of problems. the only information encoded is language-statistical

    • self@awful.systems
      link
      fedilink
      English
      arrow-up
      23
      ·
      1 month ago

      thank you for bravely rushing in and providing yet another counterexample to the “but nobody’s actually stupid enough to think they’re anything more than statistical language generators” talking point

    • sc_griffith@awful.systems
      link
      fedilink
      English
      arrow-up
      14
      ·
      edit-2
      1 month ago

      guy who totally gets what these words mean: “an llm simply encodes the semantics into the vectors”

      • self@awful.systems
        link
        fedilink
        English
        arrow-up
        15
        ·
        1 month ago

        all you gotta do is, you know, ground the symbols, and as long as you’re writing enough Lisp that should be sufficient for GAI

    • V0ldek@awful.systems
      link
      fedilink
      English
      arrow-up
      14
      ·
      1 month ago

      because it encodes semantics.

      Please enlighten me on how? I admit I don’t know all the internals of the transformer model, but from what I know it encodes precisely only syntactical information, i.e. what next syntactical token is most likely to follow based on a syntactical context window.

      How does it encode semantics? What is the semantics that it encodes? I doubt they have denatotational or operational semantics of natural language, I don’t think something like that even exists, so it has to be some smaller model. Actually, it would be enlightening if you could tell me at least what the semantical domain here is, because I don’t think there’s any naturally obvious choice for that.