Safety and Research were Sacrificed for Profit under Altman

sculd@beehaw.org · 1 year ago

Safety and Research were Sacrificed for Profit under Altman

RandoCalrandian@kbin.social · 1 year ago

Pulled up a self hosted option last week to try it out. It’s not gpt4 level, but it’s damn close and I don’t worry giving access to my local documents

PrivateGPT for anyone interested

cwagner@beehaw.org · 1 year ago

That’s an interface for models. Which model did you use?

RandoCalrandian@kbin.social · 1 year ago

Mistral-7B-Instruct-v0.1 is the default, i’m downloading the Llama2 model to test it with now, but many models on HuggingFace should still work

cwagner@beehaw.org · 1 year ago

I do not believe any 7B model comes even close to 3.5 in quality. I used LLama V1 64B, and it was horrible in comparison. Are you really telling me that this tiny model gives better general answers? Or am I just misunderstanding what you are saying?

RandoCalrandian@kbin.social · 1 year ago

I didn’t say better, I said comparable
And faster, without handing over my data and conversations for monetization

Given the locally hosted benefits, and the ability to go to chatgpt for any answer minstrel gives that doesn’t satisfy you, makes it strong competition to chatgpt as the default tool

Hosting it yourself also means you can swap llm’s out based on context and what they’re trained on. Highly tuned models perform better than chatgpt at the things they are meant to excel in.

cwagner@beehaw.org · edit-2 1 year ago

Prompt:

I’m currently trying to show on the Website Beehaw, that certain LLMs are far superior in writing than others. Examples of what bigger models do better than smaller ones: *

Mistral-7B-Instruct-v0.1

ntire articles* vs Headlines Descriptions vs Product titles *Bul

GPT 3.5-Turbo doesn’t support completion as it’s for chat, so I used an even worse one, text-davinci-003 which is far behind state of the art.

Bigger models are able to handle more complex and detailed tasks with ease

Bigger models are better suited for natural language understanding and text processing

Bigger models are able to learn more accurate representations of context, thus improving the precision of the output

Bigger models can process data more quickly and efficiently, saving time and processing power when large volumes of data are used

Bigger models can better recognize more subtle nuances in language, which allows them to produce more accurate results

Bigger models are able to use more sophisticated algorithms, resulting in a more comprehensive and deeper understanding of the data being used

Mistral 7B might be okay for some very specific cases, but it’s not comparable to proper models at all.

edit: gave it a second chance, it’s a bit better (at least no complete nonsense anymore), but still terrible writing and doesn’t make much sense

Paraphrasing The ability of a language model to generate text that has a similar meaning to the original text is called paraphrasing. This is a very common problem in natural language processing, and many LLMs are designed to be able to paraphrase text. However, there are some LLMs that are particularly good at paraphrasing, and these models are often preferred over smaller models because of their ability to generate more varied and unique text. Examples of LLMs that are known for their paraphrasing abilities include GPT-2 and transformers. These models