AiFinder All Tools llama.cpp

Pure C/C++ implementation of LLM inference. No dependencies, runs on CPU and GPU with quantization support for consumer hardware.

113,000
GitHub Stars
1h ago
Last Commit
1722
Open Issues
C++
Language

Browse categories