Hello GPT-4o

Dyf_Tfh@lemmy.sdf.org · 6 months ago

Hello GPT-4o

Dran@lemmy.world · edit-2 6 months ago

I have this running at home on a used r630 (CPU only). oobabooga/automatic1111 for LLM/SD backends, vosk + mimic3 for tts/stt. A little bit of custom python to tie it all together. I certainly don’t have latency as low as theirs, but it’s definitely conversational when my sentences are short enough.

Sabata11792@kbin.social · edit-2 6 months ago

Check out the vladmandic fork of auto1111. It seems to be much quicker with new model support.

Been wanting to try voice cloning and totally not cobble together a DIY Ai wiafu.