Ollama

Feeds to Scour
SubscribedAll
Scoured 27 posts in 27.4 ms

ulyssestenn/omt: Ollama Model Test - Figure out the best model for the task

馃捇TerminalContent type: Code
github.comHacker News

DeskDash - a free Windows tool to easily manage your GGUF files

馃彨Ramkhamhaeng
gerry7.itch.ior/LocalLLaMA

google/gemma-4-12B-it-qat-q4_0-gguf

馃彨Ramkhamhaeng
huggingface.co
Less-relevant results

Here's a llama.cpp CLI Command builder.

馃彨Ramkhamhaeng

local llm on laptop 780M GPU using llama + gemma 4 qat

馃彨RamkhamhaengContent type: Blog
alper.bearblog.dev

MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better

馃彨RamkhamhaengContent type: NewsContent type: Blog

Google AI Edge Gallery launches to macOS

馃彨Ramkhamhaeng
9to5mac.comr/apple

Gemma 4 12B: A unified, encoder-free multimodal model

馃摑how to writeContent type: Discussion

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

馃彨RamkhamhaengContent type: NewsContent type: Blog
blog.googleHacker News

mtmd : add video input support by ngxson 路 Pull Request #24269 路 ggml-org/llama.cpp

馃彨RamkhamhaengContent type: Code
github.comr/LocalLLaMA

zaydmulani09/mnemo: Local-first AI memory layer for any LLM. Persistent knowledge graph, entity extraction, semantic retrieval. Works with Ollama, OpenAI, Anthropic, or any OpenAI-compatible backend.

馃彨RamkhamhaengContent type: Code
github.comHacker News

Introducing Gemma 4 12B: a unified, encoder-free multimodal model

馃幀Keanu ReevesContent type: Blog

mtp: support for gemma-4 E2B and E4B assistants by max-krasnyansky 路 Pull Request #24282 路 ggml-org/llama.cpp

馃彨RamkhamhaengContent type: Code
github.comr/LocalLLaMA

Qwen3.6 + MTP: Calculated context size is smaller when I use `--spec-draft-type-* q4_0`. is this normal? 路 ggml-org llama.cpp 路 Discussion #24102

馃彨RamkhamhaengContent type: DiscussionContent type: Code
github.comr/LocalLLaMA

llama.cpp - Qwen3.6/3.5-MTP - Share your benchmarks t/s

馃彨RamkhamhaengContent type: Code
github.comr/LocalLLaMA

model: Granite4 Vision by gabe-l-hart 路 Pull Request #23545 路 ggml-org/llama.cpp

馃彨RamkhamhaengContent type: Code
github.comr/LocalLLaMA

Does anyone know what PCIe mode was used for these benchmarks?

馃枻CLI ToolsContent type: Code
github.comr/LocalLLaMA

[PoC] server: support requantizing kv cache by wadealexc 路 Pull Request #24134 路 ggml-org/llama.cpp

馃彨RamkhamhaengContent type: Code
github.comr/LocalLLaMA

ui: Mermaid Diagrams in chat + interactive preview by allozaur 路 Pull Request #24032 路 ggml-org/llama.cpp

馃彨RamkhamhaengContent type: Code
github.comr/LocalLLaMA

qwen35: use post-norm hidden state for MTP by am17an 路 Pull Request #24025 路 ggml-org/llama.cpp

馃彨RamkhamhaengContent type: Code
github.comr/LocalLLaMA

No more posts from hugonoss's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help