ollama-setup

Featured

Configure auto-configure Ollama when user needs local LLM deployment, free AI alternatives, or wants to eliminate hosted API costs. Trigger phrases: "install ollama", "local AI", "free LLM", "self-hosted AI", "replace OpenAI", "no API costs". Use when appropriate context detected. Trigger with relevant phrases based on skill purpose.

AI & Automation 2,274 stars 319 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%

100

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# Ollama Setup ## Overview Auto-configure Ollama for local LLM deployment, eliminating hosted API costs and enabling offline AI inference. This skill handles system assessment, model selection based on available hardware (RAM, GPU), installation across macOS/Linux/Docker, and integration with Python, Node.js, and REST API clients. ## Prerequisites - macOS 12+, Linux (Ubuntu 20.04+, Fedora 36+), or Docker runtime - Minimum 8 GB RAM for 7B parameter models; 16 GB for 13B models; 32 GB+ for 70B models - Optional: NVIDIA GPU with CUDA drivers for accelerated inference (`nvidia-smi` to verify) - Optional: Apple Silicon (M1/M2/M3) for Metal-accelerated inference on macOS - Disk space: 4-40 GB depending on model size (quantized weights) - Package manager: `brew` (macOS), `curl` (Linux), or `docker` (containerized) ## Instructions 1. Detect the host operating system and available hardware using `uname -s`, `free -h` (Linux) or `vm_stat` (macOS), and `nvidia-smi` (if GPU present) 2. Select appropriate models based on available RAM: - **8 GB**: llama3.2:7b (4 GB), mistral:7b (4 GB), phi3:14b (8 GB) - **16 GB**: codellama:13b (7 GB), mixtral:8x7b (26 GB quantized) - **32 GB+**: llama3.2:70b (40 GB), codellama:34b (20 GB) 3. Install Ollama using the platform-appropriate method: - macOS: `brew install ollama && brew services start ollama` - Linux: `curl -fsSL https://ollama.com/install.sh | sh && sudo systemctl start ollama` - Docker: `docker run -d -v ollama:/root...

Details

Author: jeremylongshore
Repository: jeremylongshore/claude-code-plugins-plus-skills
Created: 7 months ago
Last Updated: today
Language: Python
License: MIT

Integrates with

OpenAI · AI Anthropic · AI Ollama · AI Docker · Infrastructure REST API · API

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Solid

add-ollama-tool

Add Ollama MCP server so the container agent can call local models and optionally manage the Ollama model library.

29,591 Updated today

qwibitai

AI & Automation Solid

add-ollama-tool

Add Ollama MCP server so the container agent can call local models and optionally manage the Ollama model library.

29,591 Updated today

nanocoai

AI & Automation Listed

llama-cpp

Secondary local LLM inference engine via llama.cpp. This skill should be used when running GGUF models directly, loading LoRA adapters for Kothar, benchmarking inference speed, or serving models via llama-server. Includes dedicated Qwen 3.5 serve scripts (9B dense with F16 option, 35B MoE) with asymmetric KV cache and thinking mode. Complements Ollama (which remains primary for RLAMA and general use).

33 Updated 2 days ago

tdimino

AI & Automation Featured

llama-cpp

Runs LLM inference on CPU, Apple Silicon, and consumer GPUs without NVIDIA hardware. Use for edge deployment, M1/M2/M3 Macs, AMD/Intel GPUs, or when CUDA is unavailable. Supports GGUF quantization (1.5-8 bit) for reduced memory and 4-10× speedup vs PyTorch on CPU.

27,705 Updated today

davila7

AI & Automation Solid

llama-cpp

175,435 Updated today

NousResearch