L_Acacia@lemmy.one · 11 days ago

Revolt tries to be a discord clone/replacement and suffer from some of the same issues. Matrix happens to have a lot of feature in common, but is focused on privacy and security at its core.

L_Acacia@lemmy.one · 2 months ago

Mistral modèles don’t have much filter don’t worry lmao

L_Acacia@lemmy.one · 2 months ago

They is no chance they are the one training it. It costs hundreds of millions to get a descent model. Seems like they will be using mistral, who have scrapped pretty much 100% of the web to use as training data.

L_Acacia@lemmy.one · 2 months ago

Buying second hand 3090/7090xtx will be cheaper for better performances if you are not building the rest of the machine.

L_Acacia@lemmy.one · 2 months ago

You are limited by bandwidth not compute with llm, so accelerator won’t change the interferance tp/s

L_Acacia@lemmy.one · 3 months ago

I use similar feature on discord quite extensively (custom emote/sticker) and i don’t feel they are just a novelty. Allows us to have inside joke / custom reaction to specific event and I really miss it when trying out open source alternatives.

L_Acacia@lemmy.one · 3 months ago

Too be fair to Gemini, even though it is worse than Claude and Gpt. The weird answer were caused by bad engineering and not by bad model training. They were forcing the incorporattion off the Google search results even though the base model would most likely have gotten it right.

L_Acacia@lemmy.one · 7 months ago

You can take a look at exllama and llama.cpp source code on github if you want to see how it is implemented.

L_Acacia@lemmy.one · 7 months ago

If you have good enough hardware, this is a rabbithole you could explore. https://github.com/oobabooga/text-generation-webui/

L_Acacia@lemmy.one · 7 months ago

Around 48gb of VRAM if you want to run it in 4bits

L_Acacia@lemmy.one · 7 months ago

To run this model locally at gpt4 writing speed you need at least 2 x 3090 or 2 x 7900xtx. VRAM is the limiting factor in 99% of cases for interference. You could try a smaller model like mistral-instruct or SOLAR with your hardware though.

L_Acacia@lemmy.one · 8 months ago

I put zorin on my parent’s computer 2 years ago, while its a great distro, their windows app support is just marketing, its an out of date wine version with an unmaintained launcher. Worse than tinkering with wine yourself.

L_Acacia@lemmy.one · 9 months ago

It does not work exactly like obsidian as it is an outliner. I use both on the same vault and logseq is slower on larger vault.

L_Acacia@lemmy.one · 10 months ago

Do you use comfyui ?

L_Acacia@lemmy.one · 11 months ago

Being able to run benchmarks doesn’t make it is a great experience to use unfortunately. 3/4 of applications don’t run or have bugs that the devs don’t want to fix.

L_Acacia@lemmy.one · 11 months ago

Windows is not fine with ARM, which can be a turnoff for some.

L_Acacia@lemmy.one · edit-2 11 months ago

Llama models tuned for conversation are pretty good at it. ChatGPT also was before getting nerfed a million time.

L_Acacia@lemmy.one · 1 year ago

I think that for most people linux is the most simple OS to use, switched my parents and sister computer to Linux Mint and they don’t ask me to help them with windows changing their browser or moving their icons every two weeks. Though if you are trying to do anything more than web browsing, document editing and listening to music, you will have to learn how some of the os works.

L_Acacia@lemmy.one · 1 year ago

Yes, but it will take some learning time

L_Acacia@lemmy.one · 1 year ago

Llama 2 now uses a license that allows for commercial use.