• @Phroon@beehaw.org
    link
    fedilink
    English
    1511 month ago

    “You may not instantly see why I bring the subject up, but that is because my mind works so phenomenally fast, and I am at a rough estimate thirty billion times more intelligent than you. Let me give you an example. Think of a number, any number.”

    “Er, five,” said the mattress.

    “Wrong,” said Marvin. “You see?”

    ― Douglas Adams, Life, the Universe and Everything

    • AlexisFR
      link
      91 month ago

      The mattress? Like for sleeping?

      • @Asafum@feddit.nl
        link
        fedilink
        41
        edit-2
        1 month ago

        Yep! The hitchhikers books are so much fun lol

        I still think one of my favorite lines is “the ships hung in the sky in much the same way that bricks don’t.”

      • 👍Maximum Derek👍
        link
        fedilink
        English
        1021 month ago

        If you discount the pop-culture numbers (for us 7, 42, and 69) its the number most often chosen by people if you ask them for a random number between 1 and 100. It just seems the most random one to choose for a lot of people. Veritasium just did a video about it.

      • Karyoplasma
        link
        fedilink
        18
        edit-2
        1 month ago

        Probably just because it’s prime. It’s just that humans are terrible at understanding the concept of randomness. A study by Theodore P. Hill showed that when tasked to pick a random number between 1 and 10, almost a third of the subjects (n was over 8500) picked 7. 10 was the least picked number (if you ditch the few idiots that picked 0).

        • K0W4L5K1
          link
          fedilink
          71 month ago

          Maybe randomness is a label we slapped on shit we don’t understand.

          • @driving_crooner@lemmy.eco.br
            link
            fedilink
            9
            edit-2
            1 month ago

            I remember watching a lecture about probability, and the professor said that only quantum processes are really random, the rest of things that we call random is just the human inability to measure the variables that affects the random outcome. I’m an actuarie, and it’s made me change the perspective on how I see and study random processes and how it made think on ways to influence the outcome of random processes.

            • @jarfil@beehaw.org
              link
              fedilink
              8
              edit-2
              1 month ago

              …which is kind of a hilarious tautology, because “quantum processes” are by definition “processes that we are unable to decompose into more basic parts”.

              The moment we learn about some more fundamental processes being the reason for a given process, it stops being “quantum” and the new ones become “it”.

            • K0W4L5K1
              link
              fedilink
              51 month ago

              Even quantum just appears random I think. it’s beyond our scope of perspective, it works in multiple dimensions. we only see part of the process. That’s my guess though it could be totally wrong

              • @itslilith@lemmy.blahaj.zone
                link
                fedilink
                41 month ago

                it’s a matter of interpretation, but generally the consensus is that quantum measurements are truly probabilistic (random), Bell proved that there can’t be any hidden variables that influence the outcome

                • Karyoplasma
                  link
                  fedilink
                  41 month ago

                  Didn’t Bell just put that up as a theory and it got proven somewhat recently by other researchers? The 2022 physics Nobel Prize was about disproving hidden variables and they titled their finding with the catchy phrase “the universe is not locally real”.

                • K0W4L5K1
                  link
                  fedilink
                  11 month ago

                  Interpretation for sure. Bells theory and then it being proven winning a Nobel prize to me only proves more we really don’t understand the world around us and only perceive what we need to survive. And that maybe we should be less standoffish to ideas that change our current paradigm, because we obviously have a lot to learn.

      • @gigachad@feddit.de
        link
        fedilink
        51 month ago

        I didn’t know either, but it seems to be an often picked ‘random’ number by people. Here is an article about it, I didn’t read it though.

        • @FiniteBanjo@lemmy.today
          link
          fedilink
          71 month ago

          What you’ve described would be like looking at a chart of various fluid boiling points at atmospheric pressure and being like “Wow, water boils at 100 C!” It would only be interesting if that somehow weren’t the case.

          • @jarfil@beehaw.org
            link
            fedilink
            51 month ago

            Where is the “Wow!” in this post? It states a fact, like “Water boils at 100C under 1 atm”, and shows that the student (ChatGPT) has correctly reproduced the experiment.

            Why do you think schools keep teaching that “Water boils at 100C under 1 atm”? If it’s so obvious, should they stop putting it on the test and failing those who say it boils at “69C, giggity”?

            • @FiniteBanjo@lemmy.today
              link
              fedilink
              41 month ago

              Derek feeling the need to comment that the bias in the training data correlates with the bias of the corrected output of a commercial product just seemed really bizarre to me. Maybe it’s got the same appeal as a zoo or something, I never really got into watching animals be animals in a zoo.

              • @jarfil@beehaw.org
                link
                fedilink
                31 month ago

                Hm? Watching animals be animals at a zoo, is a way better sampling of how animals are animals, than for example watching that wildlife “documentary” where they’d throw lemmings of a cliff “for dramatic effect” (a “commercially corrected bias”?).

                In this case, the “corrected output” is just 42, not 37, but as the temperature increases on the Y axis, we get a glimpse of internal biases, which actually let through other patterns of the training data, like the 37.

      • @EatATaco@lemm.ee
        link
        fedilink
        English
        61 month ago

        “we don’t need to prove the 2020 election was stolen, it’s implied because trump had bigger crowds at his rallies!” -90% of trump supporters

        Another good example is the Monty Hall “paradox” where 99% of people are going to incorrectly tell you the chance is 50% because they took math and that’s how it works.

        Just because something seems obvious to you doesn’t mean it is correct. Always a good idea to test your hypothesis.

        • @FiniteBanjo@lemmy.today
          link
          fedilink
          11 month ago

          Trump Rallies would be a really stupid sample data set for American voters. A crowd of 10,000 people means fuck all compared to 158,429,631. If OpenAI has been training their models on such a small pool then I’d call them absolute morons.

          • @EatATaco@lemm.ee
            link
            fedilink
            English
            21 month ago

            A crowd of 10,000 people means fuck all compared to 158,429,631.

            I agree that it would be a bad data set, but not because it is too small. That size would actually give you a pretty good result if it was sufficiently random. Which is, of course, the problem.

            But you’re missing the point: just because something is obvious to you does not mean it’s actually true. The model could be trained in a way to not be biased by our number choice, but to actually be pseudo-random. Is it surprising that it would turn out this way? No. But to think your assumption doesn’t need to be proven, in such a case, is almost equivalent to thinking a Trump rally is a good data sample for determining the opinion of the general public.

  • HarkMahlberg
    link
    fedilink
    29
    edit-2
    1 month ago

    I mean… they didn’t specify it had to be random (or even uniform)? But yeah, it’s a good showcase of how GPT acquired the same biases as people, from people…

    • @OsrsNeedsF2P@lemmy.ml
      link
      fedilink
      221 month ago

      uniform

      Reminds me of my previous job where our LLM was grading things too high. The AI “engineer” adjusted the prompt to tell the LLM that the average output should be 3. I had a hard time explaining that wouldn’t do anything at all, because all the chats were independent events.

      Anyways, I quit that place and the project completely derailed.

  • ForestOrca
    link
    fedilink
    251 month ago

    WAIT A MINUTE!!! You mean Douglas Adams was actually an LLM?

  • FIash Mob #5678
    link
    fedilink
    221 month ago

    HA, funny that this comes up. DND Beyond doesn’t have a d100, so I opened my ChatGPT sub and had it roll a d100 for me a few times so I could use my magic beans properly.

    • @TauriWarrior@aussie.zone
      link
      fedilink
      11
      edit-2
      1 month ago

      Opened up DND Beyond to check since i remember rolling it before and its there, its between D8 and D10, the picture shows 2 dice

        • @Urist@lemmy.ml
          link
          fedilink
          English
          2
          edit-2
          1 month ago

          Yup! Also one has to mind the order in which one rolls the dice. Since 10 and 5 could be either 05 or 50. As a bonus, if you roll them in order of “tens” to “ones”, getting 10 on the first dice has added suspense since the latter dice determines if it is going to count as a low roll of 0X (by rolling 1-9 on the next dice X) or if it is going to be a max roll of 100 (by rolling another 10).

    • The Cuuuuube
      link
      fedilink
      English
      71 month ago

      But why use Chatgpt for that? Why not a duck duck go action? I just don’t understand why we’re asking a LLM whose goal is consistency, not randomness, to do random

  • ancap shark
    link
    fedilink
    211 month ago

    LMs aren’t thinking, aren’t inventing, they are predicting what is supposed to be answered next, so it’s expected that they will produce the same results every time

    • @xthexder@l.sw0.com
      link
      fedilink
      12
      edit-2
      1 month ago

      This graph actually shows a little more about what’s happening with the randomness or “temperature” of the LLM.
      It’s actually predicting the probability of every word (token) it knows of coming next, all at once.
      The temperature then says how random it should be when picking from that list of probable next words. A temperature of 0 means it always picks the most likely next word, which in this case ends up being 42.
      As the temperature increases, it gets more random (but you can see it still isn’t a perfect random distribution with a higher temperature value)

    • @eluvatar@programming.dev
      link
      fedilink
      41 month ago

      Except it clearly doesn’t produce the same result every time. You’re not making a good case for whatever you’re trying to say.

      • Cethin
        link
        fedilink
        English
        51 month ago

        They add some fuzziness to it so it doesn’t give the exact same result. Say one gets a score of 90, another 85, and other 80. The 90 will be picked more often, but they sometimes let it pick the 85, or even the 80. It’s perfectly expected, and you can see that result here with 42 being very common, but then a few others being fairly common, and most being extremely uncommon.

  • @DarkFox@pawb.social
    link
    fedilink
    171 month ago

    Which model?

    When I tried on ChatGPT 4, it wrote a short python script and executed it to get a random integer.

    import random
    
    # Pick a random number between 1 and 100
    random_number = random.randint(1, 100)
    random_number
    
      • Amju Wolf
        link
        fedilink
        English
        101 month ago

        It generates code and then you can use a call to some runtime execution API to run that code, completely separate from the neural network.

    • @Umbrias@beehaw.org
      link
      fedilink
      21 month ago

      That’s not answering the question though.

      “Pick a number between 1 and 100” doesn’t mean “grab two d10” or write a script.

  • @xyguy@startrek.website
    link
    fedilink
    English
    101 month ago

    Only 1000 times? It’s interesting that there’s such a bias there but it’s a computer. Ask it 100,000 times and make sure it’s not a fluke.

    • @kciwsnurb@aussie.zone
      link
      fedilink
      91 month ago

      The temperature scale, I think. You divide the logit output by the temperature before feeding it to the softmax function. Larger (resp. smaller) temperature results in a higher (resp. lower) entropy distribution.

        • @kciwsnurb@aussie.zone
          link
          fedilink
          11 month ago

          Each row in the figure is a probability distribution over possible outputs (x-axis labels). The more yellow, the more likely (see the colour map on the right). With a small temperature (e.g., last row), all the probability mass is on 42. This is a low entropy distribution because if you sample from it you’ll constantly get 42, so no randomness whatsoever (think entropy as a measure of randomness/chaos). As temperature increases (rows closer to the first/topmost one), 42 is still the most likely output, but the probability mass gets dispersed to other possible outputs too (other outputs get a bit more yellow), resulting in higher entropy distributions. Sampling from such distribution gives you more random outputs (42 would still be frequent, but you’d get 37 or others too occasionally). Hopefully this is clearer.

          Someone in another reply uses the word “creativity” to describe the effect of temperature scaling. The more commonly used term in the literature is “diversity”.

    • The Octonaut
      link
      fedilink
      71 month ago

      Temperature is basically how creative you want the AI to be. The lower the temperature, the more predictable (and repeatable) the response.