this post was submitted on 19 Aug 2025
9 points (90.9% liked)
Free Open-Source Artificial Intelligence
4022 readers
1 users here now
Welcome to Free Open-Source Artificial Intelligence!
We are a community dedicated to forwarding the availability and access to:
Free Open Source Artificial Intelligence (F.O.S.A.I.)
More AI Communities
LLM Leaderboards
Developer Resources
GitHub Projects
FOSAI Time Capsule
- The Internet is Healing
- General Resources
- FOSAI Welcome Message
- FOSAI Crash Course
- FOSAI Nexus Resource Hub
- FOSAI LLM Guide
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I have the same GPU and I use koboldcpp with Vulkan as the backend. Works perfectly fine. I have a 12B model and it's extremely fast. I could probably even fit a bigger model into the VRAM. Using tabbyAPI for EXL2 models didn't work for me, it always generated gibberish (I tried 2 different models). For context, I'm on Linux, so maybe that's not an issue on other operating systems.