cpu-only

Here are 12 public repositories matching this topic...

alpine-docker / ollama

Minimal CPU-only Ollama Docker Image

docker ollama llama3 cpu-only

Updated Feb 1, 2026
Dockerfile

🦙 chat-o-llama: A lightweight, modern web interface for AI conversations with support for both Ollama and llama.cpp backends. Features persistent conversation management, real-time backend switching, intelligent context compression, and a clean responsive UI.

python chat lightweight flask sqlite chatbot self-hosted developer-tools privacy-focused chat-interface conversation-history ai-chat llamacpp llama-cpp local-ai ollama offline-ai cpu-only chat-o-llama

Updated Dec 10, 2025
Python

ssgosh / WorkBuddy

Star

An LLM-based content moderator. Firefox extension to block webpages unrelated to work, based on page title and URL. Local LLMs with Ollama and Langchain to ensure your browsing history never leaves your device, for complete privacy. Google Gemini also supported.

python firefox-addon gemini llm prompt-engineering langchain ollama llama3 cpu-only

Updated Dec 12, 2024
Python

Rushi-Balapure / pdf_2_json_extractor

Star

A high-performance Python library for extracting structured content from PDF documents with layout-aware text extraction. pdf_to_json preserves document structure including headings (H1-H6) and body text, outputting clean JSON format.

python nlp pdf json cross-platform offline python-library text-extraction data-extraction pdf-parser cli-tool document-processing layout-analysis pdf-to-json pdf-processing pdf-extraction document-parsing cpu-only structure-extraction

Updated Jan 6, 2026
Python

ThatLinuxGuyYouKnow / Flutter-Image-Classification

Star

Image Classification with On-Device Inference, built with Flutter, AI model runs on mobile cpu

image-classification flutter shufflenetv2 onnxruntime cpu-only

Updated Jan 29, 2025
Dart

Blackfall-Labs / ternsig

Sponsor

Star

Ternsig Virtual Mainframe Runtime (TVMR) — extensible VM with 10 standard extensions (121 instructions), Signal ISA, mastery learning, hot-reload firmware, and thermogram persistence.

rust firmware virtual-machine extensible hot-reload neuromorphic ternary cpu-only mastery-learning signal-isa tvmr

Updated Feb 3, 2026
Rust

sunghunkwag / LowNoCompute-AI-Baseline

Sponsor

Star

CPU-friendly experience-based reasoning framework combining meta-learning (MAML), state space models (SSM), and memory buffers for fast few-shot adaptation. Pure NumPy implementation for edge devices and low-compute environments.

reinforcement-learning ai numpy ssm maml adaptation meta-learning state-space-model continual-learning experience-replay few-shot-learning memory-buffer test-time-adaptation cpu-only policy-orchestration low-compute experience-buffer

Updated Oct 23, 2025
Python

aczo / ghost-cpuonly

Star

A new one shot face swap approach for image and video domains - version tailored to work on CPU

faceswap face-swapping cpu-only

Updated Aug 20, 2024
Python

Ariyan-Pro / RAG-Latency-Optimization

Star

CPU-optimized RAG pipeline reducing latency 2.7× (247ms → 92ms). Implements caching, filtering, quantization for production. Complete with FastAPI, Docker, benchmarks, investor materials. The engineering showcase that sells itself.

docker caching dockerfile sales-engineering sqlite showcase embeddings low-latency production-ready demonstration semantic-search faiss fastapi retrieval-augmented-generation cpu-only rag-optimization ai-ml-performance-tuning becnhmarking

Updated Jan 24, 2026
Python

ThatLinuxGuyYouKnow / Face-Detection-Server

Star

Face Detection service, super fast inference with a nano model

pytorch image-classification face-recognition cpu-only

Updated Jan 26, 2025
Python

dinhtrung0706 / deepfake-presentation-aware

Star

A lightweight reproduction and analysis inspired by recent work on presentation-aware deepfake / spoofing detection, with a focus on codec-induced presentation mismatch (AMR) under CPU-only constraints.

ai deepfake-detection cpu-only

Updated Feb 3, 2026
Python

ousseeeef / chat-o-llama

Star

Chat-O-Llama is a user-friendly web interface for managing conversations with Ollama, featuring persistent chat history. Easily set up and start your chat sessions with just a few commands. 🐙💻

python chat docker lightweight gradio fasttext-embeddings mlflow mlflow-tracking conversation-history ai-chat qdrant-vector-database langchain-python local-ai qdrant-client offline-ai llama3 llama3-meta-ai cpu-only

Updated Feb 4, 2026
HTML

Improve this page

Add a description, image, and links to the cpu-only topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cpu-only topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cpu-only

Here are 12 public repositories matching this topic...

alpine-docker / ollama

ukkit / chat-o-llama

ssgosh / WorkBuddy

Rushi-Balapure / pdf_2_json_extractor

ThatLinuxGuyYouKnow / Flutter-Image-Classification

Blackfall-Labs / ternsig

sunghunkwag / LowNoCompute-AI-Baseline

aczo / ghost-cpuonly

Ariyan-Pro / RAG-Latency-Optimization

ThatLinuxGuyYouKnow / Face-Detection-Server

dinhtrung0706 / deepfake-presentation-aware

ousseeeef / chat-o-llama

Improve this page

Add this topic to your repo