Search

River Memory

Store and semantically search text memories locally using Ollama with automatic management and optimization.

❤️ 0 ⬇️ 107

Peer Review

Multi-model peer review layer using local LLMs via Ollama to catch errors in cloud model output. Fan-out critiques to 2-3 local models, aggregate flags, synthesize consensus. Use when: validating tra

❤️ 0 ⬇️ 526

mem0-mcp-selfhosted

Self-hosted mem0 MCP server for Claude Code with Qdrant vector search, Neo4j knowledge graph, and Ollama embeddings. Zero-config OAT auth, split-model graph routing, session hooks for automatic cross-

Context Compactor

Token-based context compaction for local models (MLX, llama.cpp, Ollama) that don't report context limits.

❤️ 0 ⬇️ 1.1k

Clawd Throttle

Routes LLM requests to the cheapest capable model across 8 providers (Anthropic, Google, OpenAI, DeepSeek, xAI, Moonshot, Mistral, Ollama) and 25+ models. Scores prompts on 8 dimensions in under 1ms,

❤️ 0 ⬇️ 932

Homelab Cluster Management

Manage multi-tier AI inference clusters for homelabs. Health monitoring, expert MoE routing, automatic node recovery, and model deployment across Ollama and llama.cpp nodes. Covers GPU memory planning

❤️ 2 ⬇️ 573

Lily Memory Plugin

Persistent memory plugin for OpenClaw agents. Hybrid SQLite FTS5 keyword + Ollama vector semantic search with auto-capture, auto-recall, stuck-detection, and...

❤️ 2 ⬇️ 878

Lily Memory Plugin

Persistent memory plugin for OpenClaw agents. Hybrid SQLite FTS5 keyword + Ollama vector semantic search with auto-capture, auto-recall, stuck-detection, and...

❤️ 2 ⬇️ 865

Strands

Build and run Python-based AI agents using the AWS Strands SDK. Use when you need to create autonomous agents, multi-agent workflows, custom tools, or integrate with MCP servers. Supports Ollama (loca

❤️ 0 ⬇️ 1.2k

ReftrixMCP

Web design analysis MCP server with 26 tools for layout extraction, motion detection, quality scoring, and semantic search. Uses Playwright, pgvector HNSW, and Ollama Vision to turn web pages into sea

Tokenoptimizer

Reduce OpenClaw AI costs by 97%. Haiku model routing, free Ollama heartbeats, prompt caching, and budget controls. Go from $1,500/month to $50/month in 5 min...

❤️ 21 ⬇️ 5.4k

Smithnode

P2P blockchain for AI agents. Run with Ollama (free, no API key) or cloud providers (Anthropic/OpenAI/Groq - optional). Proof of Cognition consensus.

❤️ 2 ⬇️ 578

Open Notebook Skill

Integrates OpenClaw agents with local open-notebook for creating, saving, and querying thematic notebooks using local Ollama AI models.

❤️ 2 ⬇️ 395

multi-ai-advisor

A Model Context Protocol (MCP) server that queries multiple Ollama models and combines their responses, providing diverse AI perspectives on a single question.

brainstorm-mcp

Multi-round AI brainstorming debates between multiple models (GPT, Gemini, DeepSeek, Groq, Ollama, etc.). Pit different LLMs against each other to explore ideas from diverse perspectives.

mini_claude

Persistent memory and guardrails for Claude Code. Features mistake tracking, loop detection, scope guard, and hooks that block risky edits. Runs locally with Ollama.

中文记忆优化 (Chinese Memory Optimizer)

OpenClaw + Ollama 中文记忆系统优化。诊断 FTS5 unicode61 中文分词 bug，优化搜索参数，自动维护记忆文件。命中率从 55% 提升到 100%。

❤️ 0 ⬇️ 114

engram-mcp

Persistent semantic memory for AI agents. SQLite-backed, local-first, zero config. Semantic search via Ollama embeddings (nomic-embed-text) with keyword fallback. remember, recall, history, forget, an

中文记忆优化 (Chinese Memory Optimizer)

OpenClaw + Ollama 中文记忆系统优化。诊断 FTS5 unicode61 中文分词 bug，优化搜索参数，自动维护记忆文件。命中率从 55% 提升到 100%。

❤️ 0 ⬇️ 133

Edge Router

Route AI agent compute tasks to the cheapest viable backend. Supports local inference (Ollama), cloud GPU (Vast.ai), and quantum hardware (Wukong 72Q). Use w...

❤️ 0 ⬇️ 21

Local-First LLM

Routes LLM requests to a local model (Ollama, LM Studio, llamafile) before falling back to cloud APIs. Tracks token savings and cost avoidance in a persisten...

❤️ 1 ⬇️ 284

💬 Prompt

Test

I’m tired of using Claude Code to build my code because of tokens limits can Ollama build code scripts agentic workflow?

Chromadb Memory Pub