• WildPalmTree@lemmy.world
    link
    fedilink
    English
    arrow-up
    7
    ·
    8 days ago

    ELI5 1-bit module. With three attempts, i got nothing out of it, so I assume it’s a simpler, more energy efficient model.

    • icecreamtaco@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      edit-2
      8 days ago

      It’s a massive performance upgrade, which would make current sized models better and tiny phone-sized models viable. Only problem is that models need to be retrained to use it and afaik, no one significant has done it yet.

    • thickertoofan@lemm.eeOP
      link
      fedilink
      English
      arrow-up
      4
      ·
      edit-2
      8 days ago

      i’m not the smartest out there to explain it but it’s like …instead of floating point numbers as the weights, its just -1,0,1.