llama.cpp

AI Developer Tools & Infra

Pure C/C++ implementation of LLM inference. No dependencies, runs on CPU and GPU with quantization support for consumer hardware.

113,000

GitHub Stars

1h ago

Last Commit

1722

Open Issues

C++

Language

Browse categories

LLMs & Foundation Models

AI Agents & Autonomous Workflows

RAG & Vector Search

Image & Video Generation

Voice & Audio AI

Code Assistants & Copilots

AI Developer Tools & Infra