What can I use for an offline, selfhosted LLM client, pref with images,charts, python code execution

catty@lemmy.world · edit-2 1 month ago

What can I use for an offline, selfhosted LLM client, pref with images,charts, python code execution

Mitex leo@buddyverse.one · 30 days ago

You should try https://cherry-ai.com/ … It’s the most advanced client out there. I personally use Ollama for running the models and Mistral API for advnaced tasks.

catty@lemmy.world · 29 days ago

But its website is Chinese. Also what’s the github?

happinessattack@lemmy.world · 29 days ago

https://github.com/CherryHQ/cherry-studio

Mitex leo@buddyverse.one · 30 days ago

It’s fully open source and free (as in beer).

Mitex leo@buddyverse.one · 30 days ago

You should try https://cherry-ai.com/ … It’s the most advanced client out there. I personally use Ollama for running the models and Mistral API for advnaced tasks.

Björn Tantau@swg-empire.de · 1 month ago

!localllama@sh.itjust.works

ViatorOmnium@piefed.social · 1 month ago

The main limitation is the VRAM, but I doubt any model is going to be particularly fast.

I think phi3:mini on ollama might be an okish fit for python, since it’s a small model, but was trained on python codebases.

catty@lemmy.world · 1 month ago

I’m getting very-near real-time on my old laptop. Maybe a delay of 1-2s whilst it creates the response

hendrik@palaver.p3x.de · 1 month ago

Maybe LocalAI? It doesn’t do python code execution, but pretty much all of the rest.

catty@lemmy.world · 1 month ago

This looks interesting - do you have experience of it? How reliable / efficient is it?

breadsmasher@lemmy.world · 1 month ago

AUTOMATIC1111?

https://github.com/AUTOMATIC1111/stable-diffusion-webui

TMP_NKcYUEoM7kXg4qYe@lemmy.world · 1 month ago

You can tell Open Interpreter to run commands based on you human-language input. If you want local only LLM, you can pair it with Ollama. It works for “interactive” use where you’re asked for confirmation before a command is run.

I set this up in a VM because I wanted a full automatic coding “agent” which can run commands without my intervention and I did not want it to blow up main system. It did not really work though because as far as I know Open Interpreter does not have a way to “pipe” a command’s output back into the LLM so that it could create feedback with linters and stuff.

Another issue was that Starcoder2, which is the only LLM trained on permissive licensed code I could find, only has a 15B “human-like” model. The smaller models only speak code so I don’t know how that would work for agentic usage and the 15B is really slow running on DDR4 CPU. I think agents are cool though so I would like to try Aider which is a supposedly good open source agent and unlike Open Interpreter is not abandonware.

Thanks for coming to my blabering talk, hope this might be useful for someone.

andrew0@lemmy.dbzer0.com · 1 month ago

Ollama for API, which you can integrate into Open WebUI. You can also integrate image generation with ComfyUI I believe.

It’s less of a hassle to use Docker for Open WebUI, but ollama works as a regular CLI tool.

O_R_I_O_N@lemm.ee · edit-2 1 month ago

ChainLit is a super ez UI too. Ollama works well with Semantic Kernal (for integration with existing code) and langChain (for agent orchestration). I’m working on building MCP interaction with ComfyUI’s API, it’s a pain in the ass.

30 days ago

This is what I do its excellent.

catty@lemmy.world · edit-2 29 days ago

But won’t this be a mish-mash of different docker containers and projects creating an installation, dependency, upgrade nightmare?

andrew0@lemmy.dbzer0.com · 29 days ago

All the ones I mentioned can be installed with pip or uv if I am not mistaken. It would probably be more finicky than containers that you can put behind a reverse proxy, but it is possible if you wish to go that route. Ollama will also run system-wide, so any project will be able to use its API without you having to create a separate environment and download the same model twice in order to use it.

just_another_person@lemmy.world · 1 month ago

Would like fries or a jetpack with that?

catty@lemmy.world · 1 month ago

I’ve discovered jan.ai which is far faster than GPT4All, and visually a little nicer

voidspace@lemmy.world · 1 month ago

Took ages to produce answer, and only worked once on one model, then crashed since then.

otacon239@lemmy.world · 1 month ago

I also started using this recently and it’s very plot and play. Just open and run. It’s the only client so far that feels like I could recommend to non-geeks.