Self hosted LLM

HumanPerson@sh.itjust.works · 7 months ago

Self hosted LLM

slacktoid@lemmy.ml · 7 months ago

Ollama is a nice server base, they lots of projects that plug on top of that.

grilledcheesecowboy@kbin.social · edit-2 7 months ago

I’ve had pretty good luck running llamafile on my laptop. The speeds aren’t super fast, and I can only use the models that are Mistral 7B and smaller, but the results are good enough for casual use and general R and Python code.

Edit: my laptop doesn’t have a dedicated GPU, and I don’t think llamafile has support for Intel GPUs yet. CPU inference is still pretty quick.

Paragone@lemmy.world · 7 months ago

Thanks to this post, and the other comments in here, I’ve discovered that the ultimate ui for ai-models may well be

https://github.com/ParisNeo/lollms-webui

and on HuggingFace ( that name is aweful: to me it is the creepy-horrible FaceHugger, from the movie Alien, that I saw so many decades ago ) TheBloke has some models which are smaller

https://huggingface.co/TheBloke/

so you can choose a model that will actually-work on your hardware.

I think Llama-2 for brainstorming & CodeLlama-instruct for learning programming examples seems to be the cleanest pair, from what I’ve read, and he’s got GGUF versions with different quantizations, so you can choose what will actually-fit on your hardware.

There are other models on huggingface which seem very useful, like

whisper-large-v3 for speech-to-text,
whisperspeech for text-to-speech,
sdxl-turbo for image-making ( for some copyright-free subjects to practice drawing with ), and so-on…

Some models require GPU, not all.

Damn things moved fast!

k_rol@lemmy.ca · 7 months ago

Did you try LM studio?

Possibly linux@lemmy.zip · 7 months ago

Its proprietary

Zelyios@lemmy.world · 7 months ago

OP should try H2OGPT, it is somewhat technical but the UI makes it easy to configure. You can select many models and prompt types, and you can even input your own documents so that the AI uses them to answer

GoogleyWoog@lemmy.ml · edit-2 7 months ago

deleted by creator