• Toxuin@lemmy.ca
    link
    fedilink
    arrow-up
    2
    ·
    6 months ago

    It works in reverse too. You can make any LLM “forget” that it is even able to refuse anything.

    • enkers@sh.itjust.works
      link
      fedilink
      arrow-up
      1
      ·
      edit-2
      6 months ago

      Oh for sure, and that was the main point, but I just find LLMs that refuse to do anything at all hilarious.

      I wonder how much work it’d be to use this to jailbreak llama3. I only started playing with local LLMs recently. It’s not exactly a step by step guide, but it gives you all the datasets you need and the general procedure. There’s a bit of “draw then rest of the owl,” but not too much.